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(54) Title: MODIFIED PEPTIDES AS THERAPEUTIC AGENTS 
(57) Abstract 

The present invention concerns fusion of Fc domains with biologically active peptides and a process for preparing pharmaceutical 
agents using biologically active peptides. In this invention, pharmacologically active compounds are prepared by a process comprising: a) 
selecting at least one peptide that modulates the activity of a protein of interest; and b) preparing a pharmacologic agent comprising an Fc 
domain covalently linked to at least one amino acid of the selected peptide. Linkage to the vehicle increases the half-life of the peptide, 
which otherwise would be quickly degraded in vivo. The preferred vehicle is an Fc domain. The peptide is preferably selected by phage 
display, E. coli display, ribosome display, RNA-peptide screening, or chemical-peptide screening. 
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Modified Peptides as Therapeutic Agents 
Background of the Invention 

Recombinant proteins are an emerging class of therapeutic agents. 
5 Such recombinant therapeutics have engendered advances in protein 
formulation and chemical modification. Such modifications can protect 
therapeutic proteins, primarily by blocking their exposure to proteolytic 
enzymes. Protein modifications may also increase the therapeutic 
protein's stability, circulation time, and biological activity. A review 

1 0 article describing protein modification and fusion proteins is Francis 
(1992), Focus on Growth Factors 3:4-10 (Mediscript, London), which is 
hereby incorporated by reference. 

One useful modification is combination with the "Fc" domain of an 
antibody. Antibodies comprise two functionally independent parts, a 

15 variable domain known as "Fab", which binds antigen, and a constant 
domain known as "Fc", which links to such effector functions as 
complement activation and attack by phagocytic cells. An Fc has a long 
serum half-life, whereas an Fab is short-lived. Capon et al. (1989), Nature 
337: 525-31. When constructed together with a therapeutic protein, an Fc 

2 0 domain can provide longer half-life or incorporate such functions as Fc 
receptor binding, protein A binding, complement fixation and perhaps 
even placental transfer. Id, Table 1 summarizes use of Fc fusions known in 
the art. 
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Table 1— Fc fusion with therapeutic proteins 



Form of Fc 


Fusion 
partner 


Therapeutic 
implications 


Reference 


igGi 


N-terminus of 
CD30-L 


Hodgkin's disease; 
anaplastic lymphoma; T- 
cell leukemia 


U.S. Patent No. 
5,480,981 


Murine Fcy2a 


IL-10 


anti-inflammatory; 
transplant rejection 


Zheng et (1995), *L 
Immunol. 154:5590-600 


lgG1 


TNF receptor 


septic shock 


Fisher fiLaL (1996), tL 
EnaLJ. Med. 334: 1697- 
1702; Van Zee, K. et gl. 
(1996). J. Immunol. 156: 
2221-30 


igG, IgA, 
IgM, or IgE 
(excluding 
the first 
domain) 


TNF receptor 


inflammation, autoimmune 
disorders 


U.S. Pat. No. 5,808,029, 
issued September 15, 
1998 


lgG1 


CD4 receptor 


AIDS 


Capon eLaL (1989), 
Nature 337: 525-31 


IgGi. 

lpG3 


N-terminus 
of IL-2 


anti-cancer, antiviral 


Harvill et aL (1995), 
Immunotech. 1: 95-105 


igGi 


C-terminus of 
OPG 


osteoarthritis; 
bone density 


WO 97/23614, published 
July 3, 1997 


IgGi 


N-terminus of 
leptin 


anti-obesity 


PCT/US 97/23183, filed 
December 11, 1997 


Human Ig 
Cy1 


CTLA-4 


autoimmune disorders 


Unsley (1991), JLExb, 
Med. 174:561-9 



A much different approach to development of therapeutic agents is 
peptide library screening. The interaction of a protein ligand with its 
5 receptor often takes place at a relatively large interface. However, as 

demonstrated for human growth hormone and its receptor, only a few key 
residues at the interface contribute to most of the binding energy. 
Clackson etal. (1995), Science 267: 383-6. The bulk of the protein ligand 
merely displays the binding epitopes in the right topology or serves 
1 0 functions unrelated to binding. Thus, molecules of only "peptide" length 
(2 to 40 amino acids) can bind to the receptor protein of a given large 
protein ligand. Such peptides may mimic the bioactivity oT the large 
protein ligand ("peptide agonists") or, through competitive binding, 
inhibit the bioactivity of the large protein ligand ("peptide antagonists"). 
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Phage display peptide libraries have emerged as a powerful 
method in identifying such peptide agonists and antagonists. See, for 
example, Scott etal. (1990), Science 249: 386; Devlin etal. (1990), Science 
249: 404; U.S. Pat. No. 5,223,409, issued June 29, 1993; U.S. Pat. No. 
5 5,733,731, issued March 31, 1998; U.S. Pat. No. 5,498,530, issued March 12, 
1996; U.S. Pat. No. 5,432,018, issued July 11, 1995; U.S. Pat. No. 5338,665, 
issued August 16, 1994; U.S. Pat. No. 5,922,545, issued July 13, 1999; WO 
96/40987, published December 19, 1996; and WO 98/15833, published 
April 16, 1998 (each of which is incorporated by reference). In such 

1 0 libraries, random peptide sequences are displayed by fusion with coat 
proteins of filamentous phage. Typically, the displayed peptides are 
affinity-eluted against an antibody-immobilized extracellular domain of a 
receptor. The retained phages may be enriched by successive rounds of 
affinity purification and repropagation. The best binding peptides may be 

1 5 sequenced to identify key residues within one or more structurally related 
families of peptides. See, e.g., Cwirla et al. (1997), Science 276: 1696-9, in 
which two distinct families were identified. The peptide sequences may 
also suggest which residues may be safely replaced by alanine scanning or 
by mutagenesis at the DNA level. Mutagenesis libraries may be created 

2 0 and screened to further optimize the sequence of the best binders. 
Lowman (1997), Ann. Rev. Biophvs. Biomol. Struct. 26: 401-24. 

Structural analysis of protein-protein interaction may also be used 
to suggest peptides that mimic the binding activity of large protein 
ligands. In such an analysis, the crystal structure may suggest the identity 

2 5 and relative orientation of critical residues of the large protein ligand, 
from which a peptide may be designed. See, e.g., Takasaki etal. (1997), 
Nature Biotech. 15: 1266-70. These analytical methods may-also.be used to „ 
investigate the interaction between a receptor protein and peptides 



3 
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selected by phage display, which may suggest further modification of the 
peptides to increase binding affinity. 

Other methods compete with phage display in peptide research. A 
peptide library can be fused to the carboxyl terminus of the lac repressor 
5 and expressed in E. coli . Another E. coli -based method allows display on 
the cell's outer membrane by fusion with a peptidoglycan-associated 
lipoprotein (PAL). Hereinafter, these and related methods are collectively 
referred to as " E. coli display." In another method, translation of random 
RNA is halted prior to ribosome release, resulting in a library of 
1 0 polypeptides with their associated RNA still attached. Hereinafter, this 
and related methods are collectively referred to as "ribosome display." 
Other methods employ chemical linkage of peptides to RNA; see, for 
example, Roberts & Szostak (1997), Proc. Natl. Acad. Sci. USA, 94: 12297- 
303. Hereinafter, this and related methods are collectively referred to as 
1 5 "RNA-peptide screening." Chemically derived peptide libraries have been 
developed in which peptides are immobilized on stable, non-biological 
materials, such as polyethylene rods or solvent-permeable resins. Another 
chemically derived peptide library uses photolithography to scan peptides 
immobilized on glass slides. Hereinafter, these and related methods are 
2 0 collectively referred to as "chemical-peptide screening." Chemical-peptide 
screening may be advantageous in that it allows use of D-amino acids and 
other unnatural analogues, as well as non-peptide elements. Both 
biological and chemical methods are reviewed in Wells & Lowman (1992), 
Curr. Opin. Biotechnol. 3: 355-62. 
2 5 Conceptually, one may discover peptide mimetics of any protein 

using phage display and the other methods mentioned above. These 
methods have been used for epitope mapping, for identification of critical . 
amino acids in protein-protein interactions, and as leads for the discovery 
of new therapeutic agents. E.g., Cortese etaL (1996), Curr. Opin. Biotech. 7: 
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616-21. Peptide libraries are now being used most often in immunological 
studies, such as epitope mapping. Kreeger (1996), The Scientist 10(13): 19- 
20. 

Of particular interest here is use of peptide libraries and other 
techniques in the discovery of pharmacologically active peptides. A 
number of such peptides identified in the art are summarized in Table 2. 
The peptides are described in the listed publications, each of which is 
hereby incorporated by reference. The pharmacologic activity of the 
peptides is described, and in many instances is followed by a shorthand 
term therefor in parentheses. Some of these peptides have been modified 
(e.g., to form C-terminally cross-linked dimers). Typically, peptide 
libraries were screened for binding to a receptor for a pharmacologically 
active protein (e.g., EPO receptor). In at least one instance (CTLA4), the 
peptide library was screened for binding to a monclonal antibody. 
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Ta ble 2— Pharmacologically active peptides 



Form of 
peptide 


Binding 
partner/ 
protein of 


Pharmacologic 
activity 


Reference 


intrapeptide 
disulfide- 
bonded 


EPO receptor 


EPO-mimetic 


Wrighton eLal. (1996), 
Science 273: 458-63; 
U.S. Pat. No. 5,773,569, 
issued June 30, 1998 to 
Wriqhton et al. 


C-terminally 
cross-linked 
dimer 


EPO receptor 


EPO-mimetic 


Uvnah eLal. (1996), 
Science 273: 464-71; 
Wrighton et al. (1997), 
Natl 1 "* Biotechnoloov 15: 
1261-5; International 
patent application WO 
96/40772, published 
Dec. 19, 1996 


linear 


EPO receptor 


EPO-mimetic 


NarandaeLal- (1999), 
p r n<v Natl. Acad. Sci. 
USA. 96: 7569-74 


linear 


c-MpI 


1 rU-mimeuc 


Cwirla et al (1997) 
Science 276: 1696-9; 
U.S. Pat. No. 5,869,451 . 
issued Feb. 9, 1999; U.S. 
Pat. No. 5,932,946, 
issued Aua. 3. 1999 


C-terminally 
cross^linked 
dimer 


c-MpI 


TPO-mimetic 


Cwiriaelal- (1997), 
Science 276: 1696-9 


disulfide- 
linked dimer 




Stimulation oi 

hematopoiesis 
("G-CSF-mimetic") 


Paukovits et al. (1984), 
Hnppe-SeYlers Z, 
Physiol. Chem. 365: 303- 
11;Laerum e±ai. (1988), 
Exp. Hemat. 16: 274-80 


aiKyiene- 
linked dimer 




G-CSF-mimetic 


Bhatnagarelai. (1996), 
.1 Med. Chem. 39: 3814- 
9; Cuthbertson etal. 
(1QQ7) .1 Mfid.Chem. 
40: 2876-82; King eLal. 

Fvp. Hematol. 
19:481; King eLal- 
(1 995), BiQQSi 86 (Suppl. 
1): 309a 


linear 


IL-1 receptor 


inflammatory and 
autoimmune diseases 
("IL-1 antagonist" or 
u IL-1ra-mimetic n ) 


U.S. Pat. No. 5,608,035; 
U.S.Pat. No. 5,786,331; 
U.S-Pat. No. 5,880,096; 
Yanofsky eLal- (1996), 



4 The protein listed in this column may be bound by the associated peptide (ej.. EPO 
receptor IL-1 receptor) or mimicked by the associated pept.de. The references hsted for 
each Krify 2 the molecule is bound by or mimicked by the pept.des. 
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Pmc. Natl. Acad. Sci. 93: 
7381-6; Akeson gLal- 
(1996). J. Biol. Chem . 
271:30517-23; 
WiekzorekfiLal- (1997), 
Pnl. J. Pharmacol. 49: 



107-17; Yanofsky (1996), 
PNAs. 93:7381-7386. 



linear 


Facteur 
thymique 
serique (FTS) 


stimulation of 
lymphocytes 
(TTS-mimetic") 


Inagaki-Ohara et al. 
(1996). Cellular Immunol. 
171:30-40; Yoshida 
(1984), Int. J. 
ImmunoDharmacol. 
6:141 -o. 


intrapeptide 
disulfide 
bonded 


CTLA4 MAb 


CTLA4-mimetic 


Fukumoto et al- (1998), 
Nature Biotech. 16:267- 
70 


exocyclic 


TNF-a receptor 


TNF-a antagonist 


Takasaki eLaJ. (1997), 
Nature Biotech. 15:1266- 
70; WO 98/53842, 
published December 3, 
1998 


linear 


TNF-a receptor 


TNF-a antagonist 


Chirinos-Rojas ( ), *L 
Imm.. 5621-5626. 


intrapeptide 
disulfide 
bonded 


C3b 


inhibition of complement 
activation; autoimmune 
diseases 
("C3b-antagonist") 


SahufiLal. (1996), Jl 
Immunol. 157: 884-91; 
Morikis et al. (1998), 
Protein Sci. 7: 619-27 


linear 

• 


vinculin 


cell adhesion processes — 
cell growth, differentiation, 
wound healing, tumor 

matactocic f**\/inoi il!n 
lIlCldolcLolo ^ VIMUUIIM 

binding") 


Adey et al. (1997), 
Biochem. J. 324: 523-8 


linear 


C4 binding 
protein (C4BP) 


anti-thrombotic 


Linse et al. (1997), Jl 
Bio!. Chem. 272: 14658- 
65 


linear 


urokinase 
receptor 


processes associated with 
urokinase interaction with 

its receptor (e.g., 
angiogenesis, tumor cell 
invasion and metastasis); 
rUKR antagonist") 


GoodsoneLai. (1994), 
Proc. Natl. Acad. Sci. 91: 
7129-33; International 
application WO 
97/35969, published 
October 2, 1997 


linear 


Mdm2, Hdm2 


Inhibition of inactivation of 
p53 mediated by Mdm2 or 

hdm2; anti-tumor 
("Mdm/hdm antagonist") 


Picksley fiLal- (1994), 
Oncogene 9: 2523-9: 
BottgereLal. (1997) J, 
Mol. Biol. 269: 744-56; 
BottgereJLa!. (1996), 
Oncoaene 13:2141-7 


linear 


P 21 WAF1 * 


anti-tumor by mimicking 
the activity of p21 WAF1 


-BaH-fiLal- (1997), Curr. 
Biol. 7: 71-80 


linear 


farnesyl 


anti-cancer by preventing 


Gibbs et al. (1994), Cell 



b FTS is a thymic hormone mimicked by the molecule of this invention rather than a 
receptor bound by the molecule of this invention. 

1 
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linear 



linear 



linear 



linear 



transferase 
Ras effector 
domain 



SH2/SH3 
domains 



P 16" 



Src, Lyn 



activation of ras oncogene 
anti-cancer by inhibiting 
biological function of the 
ras oncogene 

anti-cancer by inhibiting 

tumor growth with 
activated tyrosine kinases 

anti-cancer by mimicking 
activity of p16; e.g., 
inhibiting cyclin D-Cdk 
rnmplex ("p1 6-mimetic"' 
inhibition of Mast cell 
activation, IgE-related 

conditions, type I 
hypersensitivity ("Mast 



77:175-178 
Moodie et al. (1994), 
Tigris Genet 1 0: 44-48 
Rodriguez et al. (1994), 
Nature 370:527-532 
Pawson et al (1993), 
Curr. Biol. 3:434-432 
Yu et al. (1994), Cell 

76:933-945 

F&hraeus fiLal. (1996), 
Curr. Biol . 6:84-91 



StauffereLal (1997), 
piochem . 36: 9388-94 



linear 


Mast cell 
protease 


Lit? II CU ILCl^vjl iwi / 

treatment of inflammatory 
disorders mediated by 
release of tryptase-6 
("Mast cell protease 
inhibitors") 


International application 
WO 98/33812, published 
August 6, 1998 


III IGCM 


SH3 domains 


treatment of SH3- 
mediated disease states 
("SH3 antagonist") 


Rickles eLal (1994), 
EMBO J. 13: 5598-OOU4, 
Sparks fiLal. (1994), »L 
Biol. Chem. 269: 23853- 
6; Sparks fiLal- (1996), 
prnr Natl, Arad. Sci. 93: 
1540-4 


linear 


HBV core 
antigen (HBcAg) 


treatment of HBV viral 
infections ("anti-HBV") 


Dyson & Muray (1995), 
Pfn^, M»« Ar.ad.Sci. 92: 
2194-8 


linear 


selectins 


neutrophil adhesion; 
inflammatory diseases 
("selectin antagonist") 


Martens fiLal- (1 995), J* 
Biol. Chem. 270: 21129- 
36; European patent 
application EP0 714 
912, published June 5, 
1996 


linear, 
cyclized 


calmodulin 


calmodulin antagonist 


Pierce fiLal. (1995), 
Molpn Diversity 1:259- 
65; Dedman eLal. 
{ioa3),,f, Biol. Chem. 
268: 23025-30; Adey & 
Kay (1996), Qsns. 169: 
133-4 


linear, 
cyclized- 


integrins 


tumor-homing; treatment 
for conditions related to 
' integrin-mediated cellular 
events, including platelet 
aggregation, thrombosis, 
wound healing, 
osteoporosis, tissue 
ropair. anqioqenesis (e.g., 


International applications 
WO 95/14714, published 
June 1,1 995; WO 
97/08203, published 
March 6, 1997; WO 
98/10795, published 
March 19, 1998; WO 
99/24462. published May 
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for treatment of cancer), 
and tumor invasion 
("inteqrin-bindina") 


20, 1999; Kraft eLaL 
(1999), J. Biol. Chem. 
274:1979-1985 


cyclic, linear 


fibronectin and 
extracellular 
matrix 
componenio ui * 
cells and 
macrophages 


treatment of inflammatory 
and autoimmune 
conditions 


WO 98/09985, published 
March 12, 1998 


linear 


ot net si tin 

and cortistatin 


treatment or prevention of 
hormone-producing 
tumors, acromegaly, 
giantism, dementia, 
gastric ulcer, tumor 
growth, inhibition of 
hormone secretion, 
modulation of sleep or 
neural activity 


European patent 
application 0 911 393, 
published April 28, 1999 


linear 


uacienai 
lipopolysac- 
cnanuc 


antibiotic; septic shock; 
disorders modulatable by 
CAP37 


U.S. Pat. No. 5,877,151, 
issued March 2, 1999 


linear or 
cyclic, 
including D- 
amino acids 


pardaxin, mellitin 


antipathogenic 


WO 97/31 01 9, published 
28 August 1997 


linear, cyclic 


\/ip 

V li 


impotence, 
neurodegenerative 
disorders 


WO 97/40070, published 
October 30, 1997 


linear 


CTLs 


cancer 


EP 0 770 624, published 
Mav2, 1997 


linear 






Burnstein (1988), 
Biochem.. 27:4066-71. 


linear 


Amylin 




Cooper (1987),££Q£ i 
84:8628-32. 


linear 


Adrenomedullin 




Kitamura (i yyo), opnv. 
192:553-60. 


cyclic, linear 

• 




anti-anaioaenic; cancer, 
rheumatoid arthritis, 
diabetic retinopathy, 
psoriasis fVEGF 
antaqonist") 


Fairbrother (1998), 
Biochem.. 37:17754- 
17764. 


cyclic 


MMP 


inflammation and 
autoimmune disorders; 
tumor growth 
f'MMP inhibitor") 


Koivunen (1999), Nature 
£|QtBS&., 17:768-774. 

U.S. Pat. No. 5.869.452 




HGH fragment 
Echistatin 


inhibition of platelet 
aaareaation 


Gan (1988), J, Biol, 
Chem.. 263:19827-32. .. 


linear 


SLE 
autoantibody 


SLE 


WO 96/30057, published 
October 3. 1996 




GD1 alpha 


suppression ot tumor Ishikawa { i a yoj , 
metaste-* FFBS Lett. 441 (1 ): 20-4 
" endothelial cell activation , Blank ei_al. (1 999), OQ& 




antiphospholipid 


9 
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beta-2- 
glycoprotein-l 
(P2GPI) 
antibodies 



N^l AraHSci.USA96: 
5164-8 




Peptides identified by peptide library screening have been regarded 
as "leads" in development of therapeutic agents rather than as therapeutic 
agents themselves. Like other proteins and peptides, they would be 
5 rapidly removed in vivo either by renal filtration, cellular clearance 

mechanisms in the reticuloendothelial system, or proteolytic degradation. 

Francis (1992), " « ™ ^mwth Factors 3: 4-11. As a result, the art 

presently uses the identified peptides to validate drug targets or as 
scaffolds for design of organic compounds that might not have been as 
1 o easily or as quickly identified through chemical library screening. 

Lowman (1997), a™ *pv. Biophvs. Biomol. S truct. 26: 401-24; Kay etal- 
(1998), r>m ff Disc. Today 3: 370-8. The art would benefit from a process by 
which such peptides could more readily yield therapeutic agents. 

Summary of the Invention 
15 The present invention concerns a process by which the invivo half- 

life of one or more biologically active peptides is increased by fusion with 
a vehicle. In this invention, pharmacologically active compounds are 
prepared by a process comprising: 

a) selecting at least one peptide that modulates the activity of a 
20 protein of interest; and 

b) preparing a pharmacologic agent comprising at least one 
vehicle covalently linked to at least one amino acid sequence 
of the selected peptide. 

The preferred vehicle is an Fc domain. The peptides screened in step (a) 
2 5 are preferably expressed in a phage display library. The vehicle and the 
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peptide may be linked through the N- or C-terminus of the peptide or the 
vehicle, as described further below. Derivatives of the above compounds 
(described below) are also encompassed by this invention. 

The compounds of this invention may be prepared by standard 
synthetic methods, recombinant DNA techniques, or any other methods of 
preparing peptides and fusion proteins. Compounds of this invention that 
encompass non-peptide portions may be synthesized by standard organic 
chemistry reactions, in addition to standard peptide chemistry reactions 

when applicable. 

The primary use contemplated is as therapeutic or prophylactic 
agents. The vehicle-linked peptide may have activity comparable to— or 
even greater than— the natural ligand mimicked by the peptide. In 
addition, certain natural ligand-based therapeutic agents might induce 
antibodies against the patient's own endogenous ligand; the vehicle-linked 
1 5 peptide avoids this pitfall by having little or typically no sequence identity 

with the natural ligand. 

Although mostly contemplated as therapeutic agents, compounds 
of this invention may also be useful in screening for such agents. For 
example, one could use an Fc-peptide (e.g., Fc-SH2 domain peptide) in an 
2 0 assay employing anti-Fc coated plates. The vehicle, especially Fc, may 
make insoluble peptides soluble and thus useful in a number of assays. 

The compounds of this invention may be used for therapeutic or 
prophylactic purposes by formulating them with appropriate 
pharmaceutical carrier materials and administering an effective amount to 
25 a patient, such as a human (or other mammal) in need thereof. Other 
related aspects are also included in the instant invention. 

Numerous additional aspects and advantages of the present 
invention will become apparent upon consideration of the figures and 
detailed description of the invention. 

II 
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Brief Description of the Figures 
Figure 1 shows a schematic representation of an exemplary process 
of the invention. In this preferred process, the vehicle is an Fc domain, 
which is linked to the peptide covalently by expression from a DNA 
5 construct encoding both the Fc domain and the peptide. As noted in 
Figure 1, the Fc domains spontaneously form a dimer in this process. 

Figure 2 shows exemplary Fc dimers that may be derived from an 
IgGl antibody. "Fc" in the figure represents any of the Fc variants within 
the meaning of "Fc domain" herein. "X 1 " and "X 2 " represent peptides or 
1 o linker-peptide combinations as defined hereinafter. The specific dimers are 
as follows: 

A, D: Single disulfide-bonded dimers. IgGl antibodies typically 
have two disulfide bonds at the hinge region between the constant and 
variable domains. The Fc domain in Figures 2A and 2 D may be formed by 

1 5 truncation between the two disulfide bond sites or by substitution of a 
cysteinyl residue with an unreactive residue (e.g., alanyl). In Figure 2A, 
the Fc domain is linked at the amino terminus of the peptides; in 2D, at the 

carboxyl terminus. 

B, E: Doubly disulfide-bonded dimers. This Fc domain may be 
2 0 formed by truncation of the parent antibody to retain both cysteinyl 

residues in the Fc domain chains or by expression from a construct 
including a sequence encoding such an Fc domain. In Figure 2B, the Fc 
domain is linked at the amino terminus of the peptides; in 2E, at the 

carboxyl terminus. 
2 5 C, F: Noncovalent dimers. This Fc domain may be formed by 

eUmination of the cysteinyl residues by either truncation or substitution. 
One may desire to eliminate the cysteinyl residues to avoid impurities 
formed by reaction of the cysteinyl residue with cysteinyl residues of other 
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proteins present in the host cell. The noncovalent bonding of the Fc 
domains is sufficient to hold together the dimer. 

Other dimers may be formed by using Fc domains derived from different 
types of antibodies (e.g., IgG2, IgM). 
5 Figure 3 shows the structure of preferred compounds of the 

invention that feature tandem repeats of the pharmacologically active 
peptide. Figure 3 A shows a single chain molecule and may also represent 
the DNA construct for the molecule. Figure 3B shows a dimer in which the 
linker-peptide portion is present on only one chain of the dimer. Figure 3C 

1 0 shows a dimer having the peptide portion on both chains. The dimer of 

Figure 3C will form spontaneously in certain host cells upon expression of 
a DNA construct encoding the single chain shown in Figure 3A. In other 
host cells, the cells could be placed in conditions favoring formation of 
dimers or the dimers can be formed in vitro . 

1 5 Figure 4 shows exemplary nucleic acid and amino acid sequences 

(SEQ ID NOS: 1 and 2, respectively) of human IgGl Fc that may be used in 
this invention. 

Figure 5 shows a synthetic scheme for the preparation of PEGylated 

peptide 19 (SEQ ID NO: 3). 
2 0 Figure 6 shows a synthetic scheme for the preparation of PEGylated 

peptide 20 (SEQ ID NO: 4). 

Figure 7 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 5 and 6, respectively) of the molecule identified as "Fc-TMP" in 
Example 2 hereinafter. 
2 5 Figure 8 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 7 and 8, respectively) of the molecule identified as "Fc-TMP-TMP" in 
Example 2 hereinafter. " 
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Figure 9 shows the nucleotide and amino acid sequences (SEQ. ID. 
NOS: 9 and 10, respectively) of the molecule identified as "TMP-TMP-Fc" 
in Example 2 hereinafter. 

Figure 10 shows the nucleotide and amino acid sequences (SEQ. ID. 
5 NOS: 11 and 12, respectively) of the molecule identified as "TMP-Fc" in 
Example 2 hereinafter. 

Figure 11 shows the number of platelets generated in vivo in 
normal female BDF1 mice treated with one 100 ug/kg bolus injection of 
various compounds, with the terms defined as follows. 

1 o PEG-MGDF: 20 kD average molecular weight PEG attached by 

reductive amination to the N-terminal amino group of amino 

acids 1-163 of native human TPO, which is expressed in E. coli 

(so that it is not glycosylated); 
TMP: the TPO-mimetic peptide having the amino acid sequence 
1 5 IEGPTLRQWLAARA (SEQ ID NO: 13); 

TMP-TMP: the TPO-mimetic peptide having the amino acid 

sequence IEGPTLRQWLAARA-GGGGGGGG- 

IEGPTLRQWLAARA (SEQ ID NO: 14); 
PEG-TMP-TMP: the peptide of SEQ ID NO: 14, wherein the PEG 

2 o group is a 5 kD average molecular weight PEG attached as 

shown in Figure 6; 
Fc-TMP-TMP: the compound of SEQ ID NO: 8 (Figure 8) dimerized 
with an identical second monomer (i.e., Cys residues 7 and 10 
are bound to the corresponding Cys residues in the second 
2 5 monomer to form a dimer, as shown in Figure 2); and 

TMP-TMP-Fc is the compound of SEQ ID NO: 10 (Figure 9) 

dimerized in the same way as TMP-TMP-Fc except that the Fc 
domain is attached at the C-terminal end rather than the N- 
terminal end of the TMP-TMP peptide. 
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Figure 12 shows the number of platelets generated in vivo in 
normal BDF1 mice treated with various compounds delivered via 
implanted osmotic pumps over a 7-day period. The compounds are as 
defined for Figure 7. 
5 Figure 13 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 15 and 16, respectively) of the molecule identified as "Fc-EMP" in 
Example 3 hereinafter. 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 17 and 18, respectively) of the molecule identified as "EMP-Fc" in 
10 Example 3 hereinafter. 

Figure 15 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:19 and 20, respectively) of the molecule identified as "EMP-EMP-Fc" 
in Example 3 hereinafter. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
1 5 NOS: 21 and 22, respectively) of the molecule identified as "Fc-EMP-EMP" 
in Example 3 hereinafter. 

Figures 17A and 17B show the DNA sequence (SEQ ID NO: 23) 
inserted into pCFM1656 between the unique AatE (position #4364 in 
pCFM1656) and SacH (position #4585 in pCFM1656) restriction sites to 
2 0 form expression plasmid pAMG21 (ATCC accession no. 98113). 

Figure 18A shows the hemoglobin, red blood cells, and hematocrit 
generated in vivo in normal female BDF1 mice treated with one 100 ug/kg 
bolus injection of various compounds. Figure 18B shows the same results 
with mice treated with 100 ug/kg per day delivered ihcaamc doae by 7- 
2 5 day micro-osmotic pump with the EMPs delivered at 100 ug/kg, rhEPO at 
30U/mouse. (In both experiments, neutrophils, lymphocytes, and platelets 
were unaffected.) In these figures, the terms are defined as follows. 

Fc-EMP: the compound of SEQ ID NO: 16 (Figure 13) dimerized 
with an identical second monomer (i.e., Cys residues 7 and 10 are 

/Sr 
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bound to the corresponding Cys residues in the second monomer to 
form a dimer, as shown in Figure 2); 

EMP-Fc: the compound of SEQ ID NO: 18 (Figure 14) dimerized in 
the same way as Fc-EMP except that the Fc domain is attached at 
5 the C-terminal end rather than the N-terminal end of the EMP 

peptide. 

"EMP-EMP-Fc" refers to a tandem repeat of the same peptide (SEQ 
ID NO: 20) attached to the same Fc domain by the carboxyl 
terminus of the peptides. "Fc-EMP-EMP" refers to the same tandem 
0 repeat of the peptide but with the same Fc domain attached at the 

amino terminus of the tandem repeat. All molecules are expressed 
in E. coli and so are not glycosylated. 

Figures 19A and 19B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1055 and 1056) of the Fc-TNF-cc inhibitor fusion molecule 
5 described in Example 4 hereinafter. 

Figures 20A and 20B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1057 and 1058) of the TNF-a inhibitor-Fc fusion molecule 
described in Example 4 hereinafter. 

Figures 21 A and 21B show the nucleotide and amino acid sequences 
0 (SEQ ID NOS: 1059 and 1060) of the Fc-IL-1 antagonist fusion molecule 
described in Example 5 hereinafter. 

Figures 22A and 22B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1061 and 1062) of the IL-1 antagonist-Fc fusion molecule 
described in Example 5 hereinafter. 
5 Figures 23 A, 23B, and 23C show the nucleotide and amino acid 

sequences (SEQ ID NOS: 1063 and 1064) of the Fc-VEGF antagonist fusion 
molecule described in Example 6 hereinafter. 



WO 00/24782 



PCT/US99/25044 



Figures 24A and 24B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1065 and 1066) of the VEGF antagonist-Fc fusion molecule 
described in Example 6 hereinafter. 

Figures 25 A and 25B show the nucleotide and amino acid sequences 
5 (SEQ ID NOS: 1067 and 1068) of the Fc-MMP inhibitor fusion molecule 
described in Example 7 hereinafter. 

Figures 26A and 26B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1069 and 1070) of the MMP inhibitor-Fc fusion molecule 
described in Example 7 hereinafter. 

1 o Detailed Description of the Invention 

Definition of Terms 

The terms used throughout this specification are defined as follows, 
unless otherwise limited in specific instances. 

The term "comprising" means that a compound may include 
1 5 additional amino acids on either or both of the N- or C- termini of the 
given sequence. Of course, these additional amino acids should not 
significantly interfere with the activity of the compound. 

The term "vehicle" refers to a molecule that prevents degradation 
and/or increases half-life, reduces toxicity, reduces immunogenicity, or 

2 0 increases biological activity of a therapeutic protein. Exemplary vehicles 

include an Fc domain (which is preferred) as well as a linear polymer (e.g., 
polyethylene glycol (PEG), polylysine, dextran, etc.); a branched-chain 
polymer (see, for example, U.S. Patent No. 4,289,872 to Denkenwalter et 
al v issued September 15, 1981; 5,229,490 to Tarn, issued July 20, 1993; WO 
2 5 93/21259 by Frechet etal., published 28 October 1993); a lipid; a 

cholesterol group (such as a steroid); a carbohydrate or oligosaccharide; or 
any natural or synthetic protein, polypeptide or peptide that binds to a 
salvage receptor. Vehicles are further described hereinafter. 

I? 
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The term "native Fc" refers to molecule or sequence comprising the 
sequence of a non-antigen-binding fragment resulting from digestion of 
whole antibody, whether in monomeric or multimeric form. The original 
immunoglobulin source of the native Fc is preferably of human origin and 
5 may be any of the immunoglobulins, although IgGl and IgG2 are 

preferred. Native Fc's are made up of monomeric polypeptides that may 
be linked into dimeric or multimeric forms by covalent (i.e., disulfide 
bonds) and non-covalent association. The number of intermolecular 
disulfide bonds between monomeric subunits of native Fc molecules 

1 0 ranges from 1 to 4 depending on class (e.g., IgG, IgA, IgE) or subclass (e.g., 
IgGl, IgG2, IgG3, IgAl, IgGA2). One example of a native Fc is a disulfide- 
bonded dimer resulting from papain digestion of an IgG (see Ellison etal. 
(1982), Nucleic Acids Res . 10: 4071-9). The term "native Fc" as used herein 
is generic to the monomeric, dimeric, and multimeric forms. 

1 5 The term "Fc variant" refers to a molecule or sequence that is 

modified from a native Fc but still comprises a binding site for the salvage 
receptor, FcRn. International applications WO 97/34631 (published 25 
September 1997) and WO 96/32478 describe exemplary Fc variants, as 
well as interaction with the salvage receptor, and are hereby incorporated 

20 by reference. Thus, the term "Fc variant" comprises a molecule or 

sequence that is humanized from a non-human native Fc. Furthermore, a 
native Fc comprises sites that may be removed because they provide 
structural features or biological activity that are not required for the fusion 
molecules of the present invention. Thus, the term "Fc variant" comprises 

25 a molecule or sequence that lacks one or more native Fc sites or residues 
that affect or are involved in (1) disulfide bond formation, (2) 
incompatibility with a selected host cell (3) N-terminal heterogeneity upon .. 
expression in a selected host cell, (4) glycosylation, (5) interaction with 
complement, (6) binding to an Fc receptor other than a salvage receptor, or 

1ST 
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(7) antibody-dependent cellular cytotoxicity (ADCC). Fc variants are 
described in further detail hereinafter. 

The term "Fc domain" encompasses native Fc and Fc variant 
molecules and sequences as defined above. As with Fc variants and native 
5 Fc's, the term "Fc domain" includes molecules in monomeric or 

multimeric form, whether digested from whole antibody or produced by 
other means. 

The term "multimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two or more 

1 0 polypeptide chains associated covalently, noncovalently, or by both 
covalent and non-covalent interactions. IgG molecules typically form 
dimers; IgM, pentamers; IgD, dimers; and IgA, monomers, dimers, 
trimers, or tetramers. Multimers may be formed by exploiting the 
sequence and resulting activity of the native Ig source of the Fc or by 

15 derivatizing (as defined below) such a native Fc. 

The term "dimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two polypeptide chains 
associated covalently or non-covalently. Thus, exemplary dimers within 
the scope of this invention are as shown in Figure 2. 

2 0 The terms "derivatizing" and "derivative" or "derivatized" 

comprise processes and resulting compounds respectively in which (1) the 
compound has a cyclic portion; for example, cross-linking between 
cysteinyl residues within the compound; (2) the compound is cross-linked 
or has a cross-linking site; for example, the compound has a cysteinyl 

2 5 residue and thus forms cross-linked dimers in culture or in vivo; (3) one or 
more peptidyl linkage is replaced by a non-peptidyl linkage; (4) the N- 
termiims is replaced by -NRR 1 , NRaCOR 1 , -NRC(0)OR^NRS(0) 2 R\ - 
NHC(0)NHR, a succinimide group, or substituted or unsubstituted 
benzyloxycarbonyl-NH-, wherein R and R 1 and the ring substituents are 
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as defined hereinafter; (5) the C-terminus is replaced by -C(0)R 2 or -NR 3 R 4 
wherein R 2 , R 3 and R 4 are as defined hereinafter; and (6) compounds in 
which individual amino acid moieties are modified through treatment 
with agents capable of reacting with selected side chains or terminal 
5 residues. Derivatives are further described hereinafter. 

The term "peptide" refers to molecules of 2 to 40 amino acids, with 
molecules of 3 to 20 amino acids preferred and those of 6 to 15 amino acids 
most preferred. Exemplary peptides may be randomly generated by any 
of the methods cited above, carried in a peptide library (e.g., a phage 

1 0 display library), or derived by digestion of proteins. 

The term "randomized" as used to refer to peptide sequences refers 
to fully random sequences (e.g., selected by phage display methods) and 
sequences in which one or more residues of a naturally occurring molecule 
is replaced by an amino acid residue not appearing in that position in the 

1 5 naturally occurring molecule. Exemplary methods for identifying peptide 
sequences include phage display, E. coli display, ribosome display, RNA- 
peptide screening, chemical screening, and the like. 

The term "pharmacologically active" means that a substance so 
described is determined to have activity that affects a medical parameter 

2 0 (e.g., blood pressure, blood cell count, cholesterol level) or disease state 
(e.g., cancer, autoimmune disorders). Thus, pharmacologically active 
peptides comprise agonistic or mimetic and antagonistic peptides as 
defined below. 

The terms "-mimetic peptide" and "-agonist peptide" refer to a 
2 5 peptide having biological activity comparable to a protein (e.g., EPO, TPO, 
G-CSF) that interacts with a protein of interest. These terms further 
include peptides that indirectly mimic the activity of a protein of interest, . 
such as by potentiating the effects of the natural ligand of the protein of 
interest; see, for example, the G-CSF-mimetic peptides listed in Tables 2 
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and 7. Thus, the term "EPO-mimetic peptide" comprises any peptides that 
can be identified or derived as described in Wrighton etaL (1996), Science 
273 : 458-63, Naranda etaL (1999), Proc. Natl. Ac ad. Sci. USA 96: 7569-74, 
or any other reference in Table 2 identified as having EPO-mimetic subject 
5 matter. Those of ordinary skill in the art appreciate that each of these 
references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 
libraries. 

The term "TPO-mimetic peptide" comprises peptides that can be 

1 0 identified or derived as described in Cwirla et al . (1997), Science 276: 1696- 
9 , U.S. Pat. Nos. 5,869,451 and 5,932,946 and any other reference in Table 2 
identifed as having TPO-mimetic subject matter, as well as the U.S. patent 
application, "Thrombopoietic Compounds," filed on even date herewith 
and hereby incorporated by reference. Those of ordinary skill in the art 

1 5 appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

The term "G-CSF-mimetic peptide" comprises any peptides that 
can be identified or described in Paukovits etal. (1984), Hoppe-Sevlers Z. 

2 0 Phvsiol. Chem . 365: 303-11 or any of the references in Table 2 identified as 
having G-CSF-mimetic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

2 5 The term "CTLA4-mimetic peptide" comprises any peptides that 

can be identified or derived as described in Fukumoto et al . (1998), Nature 
Biotech . 16: 267-70. Those of ordinary skill in the art appreeiate_that each of , 
these references enables one to select different peptides than actually 
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disclosed therein by following the disclosed procedures with different 
peptide libraries. 

The term "-antagonist peptide" or "inhibitor peptide" refers to a 
peptide that blocks or in some way interferes with the biological activity of 
5 the associated protein of interest, or has biological activity comparable to a 
known antagonist or inhibitor of the associated protein of interest. Thus, 
the term "TNF-antagonist peptide" comprises peptides that can be 
identified or derived as described in Takasaki etal. (1997), Nature Biotech . 
15: 1266-70 or any of the references in Table 2 identified as having TNF- 

1 0 antagonistic subject matter. Those of ordinary skill in the art appreciate 
that each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

The terms "IL-l antagonist" and "IL-lra-mimetic peptide" 

1 5 comprises peptides that inhibit or down-regulate activation of the IL-l 
receptor by IL-l. IL-l receptor activation results from formation of a 
complex among IL-l, IL-l receptor, and IL-l receptor accessory protein. 
IL-l antagonist or IL-lra-mimetic peptides bind to IL-l, IL-l receptor, or 
IL-l receptor accessory protein and obstruct complex formation among 

2 0 any two or three components of the complex. Exemplary IL-l antagonist 
or IL-lra-mimetic peptides can be identified or derived as described in 
U.S. Pat. Nos. 5,608,035, 5,786,331, 5,880,096, or any of the references in 
Table 2 identified as having IL-lra-mimetic or IL-l antagonistic subject 
matter. Those of ordinary skill in the art appreciate that each of these 

2 5 references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 
libraries. - . 

The term "VEGF-antagonist peptide" comprises peptides that can 
be identified or derived as described in Fairbrother (1998), Biochem. 37: 
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17754-64, and in any of the references in Table 2 identified as having 
VEGF-antagonistic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
5 procedures with different peptide libraries. 

The term "MMP inhibitor peptide" comprises peptides that can be 
identified or derived as described in Koivunen (1999), Nature Biotech. 17: 
768-74 and in any of the references in Table 2 identified as having MMP 
inhibitory subject matter. Those of ordinary skill in the art appreciate that 

1 0 each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

Additionally, physiologically acceptable salts of the compounds of 
this invention are also encompassed herein. By "physiologically 

1 5 acceptable salts" is meant any salts that are known or later discovered to 
be pharmaceutically acceptable. Some specific examples are: acetate; 
trifluoroacetate; hydrohalides, such as hydrochloride and hydrobromide; 
sulfate; citrate; tartrate; glycolate; and oxalate. 
Structure of compounds 

2 o In General . In the compositions of matter prepared in accordance 

with this invention, the peptide may be attached to the vehicle through the 
peptide's N-terminus or C-terminus. Thus, the vehicle-peptide molecules 
of mis invention may be described by the following formula I: 
I 

25 (X 1 ) a -F 1 -(X 2 ) b 

wherein: 

F l is a vehicle (preferably an Fc domain); 

X 1 and X 2 are each independently selected from -(L l ) c -P l , -(LVP 1 - 
(L 2 ) d -P 2 , ^VPWV^LVP 3 , ^l-PW-^e -P 3 -(L 4 ) r P 4 



WO 00/24782 



PCTAJS99/2S044 



P\ P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 

L 1 , V, V, and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided that at 
5 least one of a and b is 1 . 

Thus, compound I comprises preferred compounds of the formulae 

n 

X 1 -F' 

and multimers thereof wherein F 1 is an Fc domain and is attached at the C- 
1 0 terminus of X 1 ; 

in 

f'-x 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
terminus of X 2 ; 
15 IV 

F , -(L , ) C -P 1 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 

terminus of -(L'^-P 1 ; and 

V 

20 FMLVPMLVP 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 

terminus of -L'-P'-L'-P 2 . 

Peptides . Any number of peptides may be used in conjunction with 
the present invention. Of particular interest are peptides that mimic the 
2 5 activity of EPO, TPO, growth hormone, G-CSF, GM-CSF, IL-lra, leptin, 
CTLA4, TRAIL, TGF-a, and TGF-p. Peptide antagonists arejilso of 
interest, particularly those antagonistic to the activity of TNF, leptin, any 
of the interleukins (IL-1, 2, 3, ...), and proteins involved in complement 
activation (e.g., C3b). Targeting peptides are also of interest, including 
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tumor-homing peptides, membrane-transporting peptides, and the like. 
All of these classes of peptides may be discovered by methods described in 
the references cited in this specification and other references. 

Phage display, in particular, is useful in generating peptides for use 
5 in the present invention. It has been stated that affinity selection from 
libraries of random peptides can be used to identify peptide ligands for 
any site of any gene product. Dedman etal. (1993), T. Biol. Chem . 268: 
23025-30. Phage display is particularly well suited for identifying peptides 
that bind to such proteins of interest as cell surface receptors or any 

1 0 proteins having linear epitopes. Wilson etal. (1998), Can. T. Microbiol. 44: 
313-29; Kay eLal. (1998), Drug Disc. Today 3: 370-8. Such proteins are 
extensively reviewed in Herz etal. (1997), T. Receptor & Signal 
Transduction Res . 17(5): 671-776, which is hereby incorporated by 
reference. Such proteins of interest are preferred for use in this invention. 

15 A particularly preferred group of peptides are those that bind to 

cytokine receptors. Cytokines have recently been classified according to 
their receptor code. See Inglot (1997), Archivum I mmunologiae et 
Therapiae Experimentalis 45: 353-7, which is hereby incorporated by 
reference. Among these receptors, most preferred are the CKRs (family I in 

2 0 Table 3). The receptor classification appears in Table 3. 
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Table 3— Cytokine Receptors Classified by Receptor Code 



Cytokines (ligands) 


Receptor Type 


family subfamily 


family subfamily 


1. Hematopoietic 1. IL-2, IL-4, IL-7, 
cytokines IL-9, IL-13, IL- 
15 

2. IL-3. IL-5, GM- 
CSF 

3. IL-6, IL-11.IL- 
12, LIF.OSM, 
CNTF, leptin 
(OB) 

4. G-CSF, EPO, 
TPO, PRL, GH 

5. IL-17, HVS-IL- 
17 


I. Cytokine R 1. shared yCr 
(CKR) 

2. shared GP 140 
PR 

3. 3.shared RP 
130 

4. "single chain" R 

5. other R e 


II. IL-10 ligands IL-10, BCRF-1, 
HSV-IL-10 


II. IL-10 R 


III. Interferons 1. IFN-al, a2, a4, 

m, t, IFN-p d 
2. IFN-y 


III. Interferon R 1. IFNAR 
2. IFNGR 


IV. IL-1 ligands 1. IL-1a, IL-1p, IL- 

1Ra 


IV. IL-1R 


V. TNF ligands 1. TNF-a, TNF-p 

(LT),FAS1, 
CD40 L, 
CD30L. CD27 L 


V. NGF/TNF R° 


VI. Chemokines 1. a chemokines: 

IL-8, GRO 
a, p,y, IF-10, 
PF-4, SDF-1 

2. p chemokines: 
MIP1a, MIP1p, 
MCP-1 ,2,3,4, 
RANTES, 
eotaxin 

3. y chemokines: 
lymphotactin 


VI. ChemokineR 1. CXCR 

2. CCR 

3. CR 

4. DARC* 



c IL-17R belongs to the CKR family but is not assigned to any of the 4 indicated subjamilies. 
d Other IFN type I subtypes remain unassigned. Hematopoietic cytokines, IL-10 ligands and 
interferons do not possess functional intrinsic protein kinases. The signaling molecules for the 
cytokines are JAK's, STATs and related non-receptor molecules. IL-14, IL-16 and IL-18 have been 
cloned but according to the receptor code they remain unassigned. 

6 TNF receptors use multiple, distinct intracellular molecules for signal transductionjncluding 
"death domain" of FAS R and 55 kDa TNF-aR that participates in their cytotoxic effects. NGF/TNF 
R can bind both NGF and related factors as well as TNF ligands. Chemokine receptors are G 
protein-coupled, seven transmembrane (7TM, serpentine) domain receptors. 
r The Duffy blood group antigen (DARC) is an erythrocyte receptor that can bind several different 
chemokines. It belongs to the immunoglobulin superfamily but characteristics of its signal 
transduction events remain unclear. 
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VII. Growth factors 

1.1 SCF,M-CSF, 
PDGF-AA, AB, 
BB, FLT-3L, 
VEGF, SSV- 
PDGF 

1.2 FGFa, FGFp 

1.3 EGF, TGF-a, 
VV-F19(EGF- 
like) 

1.4IGF-I, IGF-II, 
Insulin 

1.5 NGF, BDNF, 
NT-3, NT-4° 
2, TGF-B1,P2,g3 



VII. RKF 1. TK sub-family 

1.1 IgTKIIIR 



1.2 IgTKIVR 

1.3 Cysteine-rich 
TK-I 

1 .4 Cysteine rich 
TK-II 

1.5 Cysteine knot 
TK V 

2. STK subfamily' 



Exemplary peptides for this invention appear in Tables 4 through 
20 below. These peptides may be prepared by methods disclosed in the 
5 art. Single letter amino acid abbreviations are used. The X in these 

sequences (and throughout this specification, unless specified otherwise in 
a particular instance) means that any of the 20 naturally occurring amino 
acid residues may be present. Any of these peptides may be linked in 
tandem (i.e., sequentially), with or without linkers, and a few tandem- 

1 0 linked examples are provided in the table. Linkers are listed as "A" and 
may be any of the linkers described herein. Tandem repeats and linkers 
are shown separated by dashes for clarity. Any peptide containing a 
cysteinyl residue may be cross-linked with another Cys-containing 
peptide, either or both of which may be linked to a vehicle. A few cross- 

1 5 linked examples are provided in the table. Any peptide having more than 
one Cys residue may form an intrapeptide disulfide bond, as well; see, for 
example, EPO-mimetic peptides in Table 5. A few examples of 
intrapeptide disulfide-bonded peptides are specified in the table. Any of 
these peptides may be derivatized as described herein, and a few 

2 0 derivatized examples are provided in the table. Derivatized peptides in 



The neurotrophic cytokines can associate with NGF/TNF receptors also. 
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the tables are exemplary rather than limiting, as the associated 
underivatized peptides may be employed in this invention, as well. For 
derivatives in which the carboxyl terminus may be capped with an amino 
group, the capping amino group is shown as -NH^. For derivatives in 
5 which amino acid residues are substituted by moieties other than amino 
acid residues, the substitutions are denoted by a, which signifies any of 
the moieties described in Bhatnagar etal. (1996), T. Med. Chem . 39: 3814-9 
and Cuthbertson etal. (1997), T. Med. Chem . 40: 2876-82, which are 
incorporated by reference. The J substituent and the Z substituents (Z s , 

10 . . .Z J are as defined in U.S. Pat. Nos. 5,608,035 ,5,786,331, and 5,880,096, 
which are incorporated by reference. For the EPO-mimetic sequences 
(Table 5), the substituents X 2 through X n and the integer "n" are as defined 
in WO 96/40772, which is incorporated by reference. The substituents 
"0," and "+" are as defined in Sparks etal. (1996), Proc. Natl. Acad. Sci . 93: 

1 5 1540-4, which is hereby incorporated by reference. X 4 , and X 7 are as 

defined in U.S. Pat. No. 5,773,569, which is hereby incorporated by 
reference, except that: for integrin-binding peptides, X,, X 2 , X y X 4 , X^ X^ X 7 , 
and X 8 are as defined in International applications WO 95/14714, 
published June 1, 1995 and WO 97/08203, published March 6, 1997, which 

2 0 are also incorporated by reference; and for VIP-mimetic peptides, X„ X x \ 
X", X 2 , X 3 , X^ Xg, X 6 and Z and the integers m and n are as defined in WO 
97/40070, published October 30, 1997, which is also incorporated by 
reference. Xaa and Yaa below are as defined in WO 98/09985, published 
March 12, 1998, which is incorporated by reference. AA,, AA 2 , AB 17 AB 2 , 

2 5 and AC are as defined in International application WO 98/53842, 

published December 3, 1998, which is incorporated by reference. X 1 , X 2 , X 3 , 
and X 4 in Table 17 only are as defined in European application EP 0 911 

h STKS may encompass many other TGF-p-related factors that remain unassigned. The protein 
kinases are intrinsic part of the intracellular domain of receptor kinase family (RKF). The enzymes 
participate in the signals transmission via the receptors. 
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393, published April 28, 1999. Residues appearing in boldface are D- 
amino acids. All peptides are linked through peptide bonds unless 
otherwise noted. Abbreviations are listed at the end of this specification. In 
the "SEQ ID NO." column, "NR M means that no sequence listing is required 
5 for the given sequence. 



Table 4 — IL-1 antagonist peptide sequences 



sequence/ st rue cure 


SFO 
lu inu: 




212 


XXQZ,YZ fi XX 


907 


Z 7 XQZ^YZ fi XX 


908 


Z,Z fl QZ<YZ fi Z„Z 10 


909 


Z^QZ.YZ^ 


910 




917 


Z^NZ^Z^Z^Z^Z^Z^Z.. 


979 


TANVSSFEWTPYYWQPYALPL 


213 


SWTDYGYWQPYALPISGL 


214 


ETPFTWEESNAYYWQPYALPL 


215 


ENTYSPNWADSMYWQPYALPL 


216 


SVGEDHNrWTScYWQrYALrL 




DGYDRWRQSGERYWQPYALPL 


218 


FEWTPGYWQPY 


219 


FEWTPGYWQHY 


220 


FEWTPGWYQJY 


221 


AcFEWTPGWYQJY 


222 


FEWTPGWpYQJY 


223 


FAWTPGYWQJY 


224 


FEWAPGYWQJY 


225 


FEWVPGYWQJY 


226 


FEWTPGYWQJY 


227 


AcFEWTPGYWQJY 


228 


FEWTPaWYQJY 


229 


FEWTPSarWYQJY 


230 


FEWTPGYYQPY 


231 


FEWTPGWWQPY 


232 


FEWTPNYWQPY 


233 


FEWTPvYWQJY 


234 


FEWTPecGYWQJY 


235 


FEWTPAibYWQJY 


236 


FEWTSarGYWQJY 


237 


FEWTPGYWQPY 


238 


FEWTPGYWQHY 


239 


FEWTPGWYQJY 


240 
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AcFEWTPGWYQJY 


241 


FEWTPGW-pY-QJY 


242 


FAWTPGYWQJY 


243 


FEWAPGYWQJY 


244 


1 FEWVPGYWQJY 


245 


1 FEWTPGYWQJY 


246 


AcFEWTPGYWQJY 


247 


FEWTPAWYQJY 


248 


FEWTPSarWYQJY 


249 


FFWTPGYYQPY 


250 


FEWTPGWWQPY 


251 


FEWTPNYWQPY 


252 


FEWTPVYWQJY 


253 


FFWTPprGYWOJY 


254 


FFWTPAibYWOJY 


255 


FFWT^a rG YWOJ Y 


256 


FFWTPGYWOPYALPL 


257 


1 ManFWTPfiYYOJY 

1 1 NdfJCZ V V 1 i\J I I VjIO 1 


258 


VFWTPfiYYOJY 


259 


PP\A/\/Pr5 YYO. )Y 


260 


FFWTP^YYOJY 


261 


CFWTPNYYn IY 


262 


TKPR 


263 


RKQQK 


264 




265 




266 


Rk'onkR 


267 


FNIRKHnKRF 


268 


\rrvFYF 


269 


X/Tk'FY 


270 


vrnFY 


271 


^Hl YWOPYSVQ 


671 


Tl VYWOPYSLQT 


672 


RGDYWOPYSVOS 


673 


VHVYWOPYSVOT 


674 


RLVYWQPYSVQT 


675 


SRVWFQPYSLQS 


676 


M MVYWOP YS IOT 


677 


SWFWQPYSVQT 


678 


TFVYWQPYALPL 


679 


TLVYWQPYSIQR 


680 


RLVYWQPYSVQR 


681 


SPVFWQPYSIQI 


682 


WiEWWQPYSVQS 


683 


SLIYWQPYSLQM 


684 


TRLYWQPYSVQR 


■~ 685 


RCDYWQPYSVQT 


686 


MRVFWQPYSVQN 


687 


KIVYWQPYSVQT 


688 


RHLYWQPYSVQR 


689 
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ALVWWQPYSEQI 


MO 


SRVWFQPYSLQS 


O/l 


WEQPYALPLE 




QLVWWQPYSVQR 




DLRYWQPYSVQV 


MA 


ELVWWQPYSLQL 


WD 


DLVWWQPYSVQW 




NGNYWQPYSFQV 


AQ7 


ELVYWQPYSIQR 


070 


ELMYWQPYSVQE 




NLLYWQPYSMQD 


7flfl 


GYEWYQPYSVQR 


/Ul 


SRVWYQPYSVQR 


7n? 


LSEQYQPYSVQR 


/\jO 


GGGWWQPYSVQR 


7HA 


VGRWYQPYSVQR 


7fm 
/\JD 


VHVYWQPYSVQR 


/Uo 


QARWYQPYSVQR 


/U/ 


VHVYWQPYSVQT 


/Uo 


RSVYWQPYSVQR 


709 


TRVWFQPYSVQR 


710 


GRIWFQPYSVQR 


711 


GRVWFQPYSVQR 


712 


ARTWYQPYSVQR 


713 


ARVWWQPYSVQM 


714 


RLMFYQPYSVQR 


715 


ESMWYQPYSVQR 


716 


HFGWWQPYSVHM 


717 


ARFWWQPYSVQR 


71o 


RLVYWQ PYAPIY 


7iv 


RLVYWQ PYSYQT 


/zu 


RLVYWQ PYSLPI 


721 


RLVYWQ PYSVQA 


722 


SRVWYQ PYAKGL 


/Zo 


SRVWYQ PYAQGL 


/Z4 


SRVWYQ PYAMPL 


/JO 


SRVWYQ PYSVQA 


/Zo 


SRVWYQ PYSLGL 


707 

/Z/ 


SRVWYQ PYAREL 


/Zo 


SRVWYCJ rYonUr 


729 


SRVWYQ PYFVQP 


730 


EYEWYQ PYALPL 


731 


IPEYWQ PYALPL 


732 


SRIWWQ PYALPL 


733 


. DPLFWQ PYALPL 


734 


SROWVQ PYALPL 


735 


IRSWWQ PYALPL 


736 


RGYWQ PYALPL 


737 


RLLWVQ PYALPL 


738 


EYRWFQ PYALPL 


739 
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DAYWVO PYALPL 


740 


WSGYFQ PYALPL 


741 


NIEFWQ PYALPL 


742 


TRDWVQ PYALPL 


743 


DSSWYO PYALPL 


744 


IGNWYQ PYALPL 


745 


Ml RWDO PYALPL 


746 


1 PPRA/fl PYAI PI 


747 


HQV\AAA/0 PVAI Pi 
UoYWVVU rTMLrL 


748 


pcnwn pvai pi 


749 


ADQA/I f\ DVAI PI 

ArirWUJ rYMLrL 


750 


NoYrWU rYALrL 


751 


DCKilV\A/nDVC\/nP 

HrMYWUr YoVJJrt 


752 


AnLrWUrYoVUn 


753 


WWUrYALrL 


754 


\/\//^\r->\/ A 1 ni 

YYUrYALr'L 


755 


YrUrYALtiL 


756 


A/\A/Vr\D\/AI DI 

YWYQPYALrL 


/ o/ 


DVArtA/HDVATDI 

HVVVVUr YA 1 r L 


758 


OIA/VADVAI r~ 

(jiWYUrYALOir 


75Q 


YWYUrYALtaL 


760 


IWYUrYAMrL 


761 
/ox 


OKI* A/^DV/^DI O 


76? 


1 hVYWUrY AVtaLrAAt 1 AOIM 


76^ 


1 hVYWUrY oVUM 1 1 1 or\ V 1 M 


764 


TCVA/1A/ADV CCUVV\/DY/^CTDI 

TrVYWUPY ooriAAvr AorrL 


765 


TCVA/\A/nDV \//~\K |D/^\\A/ A IUWDL-1 

1 rVYWUrY YCaNrUWAInvrin 


766 


irVYWCJr J Y VLLtLrfcvaAVnA 


767 


IrVYWUrY VUYVVVrlrlALiV 


768 


oWYQPYVUijiVvH 


769 


nWbvJrYVKUvjVVo 


770 


fcWYQPYALCaWAK 


771 


PVAAA/ODVA Df^l 

CaWWUrYArioL 


772 


1 CCADVA 1/ A 1 ^1 

Lr tCJ r Y AlxALvji L 


773 


AlA/CADV A DAI An 

oWbUrYAnbLAb 


774 


A\A/\/nPVATPI nF 


775 


MWl Ur Y OOUrMC 


776 


^\ArrnPVQnnr5 c\/ 

VaVV IUr TOUUOCV 


777 


n\A/cr^DVCir^Qnp 


778 


PWIOPYARGFG 


779 


RPLYWQPYSVQV 


780 


TLIYWQPYSVQI 


781 


RFDYWQPYSDQT 


782 


WHQFVQPYALPL 


783 


EWDS VYWQPYSVQ TLLR 


784 


WEQN VYWQPYSVQ SFAD 


~ 785 


SDV VYWQPYSVQ SLEM 


786 


YYDG VYWQPYSVQ VMPA 


787 


SDIWYQ PYALPL 


788 


QRIWWQ PYALPL 


789 
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^RIWWO PYALPL 


790 


R9I YWO PYALPL 


791 


TIIWFQ PYALPL 


792 


WFTWYO PYALPL 


793 


^YDWFQ PYALPL 


794 


^RlWflO PYALPL 


795 


FIMFWQ PYALPL 


796 


DYVWQQ PYALPL 


797 


MDLLVO WYQPYALPL 


798 


GSKVIL WYQPYALPL 


799 


ROC5ANI WYQPYALPL 


800 


nnnDFP WYQPYALPL 


801 


<^OI FRT WYOPYALPL 


802 


PTWVRP WYHPYAI PI 
C I VV V nC VV T VJi T r\L.r 1— 


803 


I^^QTH WVHPYAI PI 


804 


1 HAPMM WVOPYAI PI 
LUMnlVIIN VV T VJi T nLr U 


805 


CDDCOl^ VA/VOPVAI PI 


806 


\/wrMs\Aio \a/vhpvai Pi 
VIMJrxWn VV JKJr TMi_ri_ 


807 


i DDun\/ \aa/hpvai pi 
LHKrlUV WYUrYMLrL 


808 


noTACI IA/VODVAI PI 

Rol Aol WYUrYALrL 


809 


CPI/cnA \A/VADVAI PI 

cSKcUCj WYUrYALrL 


810 


COI "TAJIL/ \AA/ADVA 1 PI 

fcoLIMK. WYUrYALrL 


811 


toonbo WYUrYALrL 


812 


VIcWWCJ rYALrL 


813 


\/\A/N/\A/C/"N DN/AI PI 

VWYWtVJ rYALrL 


814 


AOC1AAA/A DN/AI Dl 

AStWWCJ rYALrL 


815 


CVC\AAA/A DVAI PI 

FYcWWU rYALrL 


816 


CA\AAA/\/A DN/AI PI 

bbWWVU rYALrL 


817 


%A//"*E\A/I /~\ DN/AI Dl 

WCacWLU rYALrL 


818 


nv\AA/cr» dvai PI 
UYVWcCJ rYALrL 


819 


A UT\AAA/n PVAI PI 
AM 1 WVVU r TnLrL 


820 


CIOA/CA DN/AI Dl 

MfcWrvJ rYALrL 


821 


IA/1 AlA/Cn PVAI PI 

WLAWtU rTMLrL 


822 


A/MIPXAAA/O PVAI PI 
VWIcWWU rYALrL 


823 


CRM\A/n PVAI PI 
cnivivvvj rinLrL 


824 


MYYVA/YY PVAI PI 
INaAWAA rTMLrL 


825 


\A/^NHA/VH PVAI PI 
WolNJWTLJ r YALrL 


826 


Tl VlA/FO PYAI PI 
I L Y VV XZ\Jt r T r\i_r i_ 


827 


\AA/P\A/PO PVAI PI 
VWnWtU rTnLrL 


828 


LLWTQ PYALPL 


829 


SRIWXX PYALPL 


830 


SDIWYQ PYALPL 


831 


WGYYXX PYALPL 


832 


TSGWYQ PYALPL 


833 


VHPYXX PYALPL 


834 


EHSYFQ PYALPL 


~ 835 


XXIWYQ PYALPL 


836 


AQLHSQ PYALPL 


837 


WANWFQ PYALPL 


838 


SRLYSQ PYALPL 


839 
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GVTFSQ PYALPL 


840 


SIVWSQ PYALPL 


841 


SRDLVQ PYALPL 


842 


HWGH VYWQPYSVQ DDLG 


843 


SWHS VYWQPYSVQ SVPE 


844 


WRDS VYWQPYSVQ PESA 

Till V 1 V v N^C 1 1 * %M t ^M^^r » 


845 


TWDA VYWQPYSVQ KWLD 


846 


TPPW VYWQPYSVQ SLDP 


847 


v\A/qs VYWQPYSVQ SVHS 

1 WOO V 1 VVVXI 1 O » vj< <J ¥ I 


848 


YWY OPY ALGL 


849 


YWY OPY Al PI 

T VV T Wr 1 nuL 


850 


FWI HPY ATOL 

C.VVI vJi T r\ 1 VJL 


851 


NWF OPY AKPL 

IM VV C» V»m 1 T nr\rL 


852 


AFY OPY Al PI 


853 


Fl Y OPY ALPL 

1 L.T VJr 1 l_ 


854 


wpi/ OPY 1 FWP. 


855 


FTPFTWFF^NIAYYWOPYALPL 


856 


nnwi TWO D^V DM YWO PYALPL 

VJVJ> VV l_ 1 V V VVL/O V LSI VI 1 V V Vjc i 1 I— 


857 


F^FA^YTWPFNTYWO PYALPL 


858 


TFQPOOI DWAKI YWO PYALPL 

1 cor vi\3i-L/VVr\r\i i vv\jjr 1 hli l_ 


859 


nnvnR wro^of rywopyalpl 


860 


TANV^^FFWTPfi YWO PYALPL 


861 


^VrtFnHMFWT^F YWOPYALPL 

O VOuLfnlNr VV IOC I VVVrfi 1 r\Lr L. 


862 


MMnOTQPV/QTFP YWOPYAI PI 


863 


CU/CPACPriPRMI VWOPYAI PI 


864 


AVACDOAI kinW^l VWOPYAI PI 


865 


M^niA/ATAniA/QMY VWOPYAI PI 
NvnUVVA I MUWolN T Y VVVJr TrtL.rl- 


866 


TLJrvPLM VWOPVAI PI 
1 riUtnl YVVUrTnLrL 


867 


mi cinvnwTPrs vwopyai pi 

MLtl\ IYI 1 W 1 r VJ T VVUrTMLrL 


868 


i u/onpl TPnAHl VWOPYAI PI 


869 


cnArrrronciOAM VWOPYAI PI 


870 


^nnAAWPTnQi T VWOPYAI PI 


871 


AIIDOI VHWQFM YWOPYAI PI 


872 


FMTVQPMWAn<5M YWOPYAI PL 
CIN 1 Y oi ImWMUOIVI T VVVju inLrU 


873 


IWtNnOT^FV^TFP YWOPYALPL 

Ivl IN LJ\j{ 1 Ot-VO 1 1 1 I V v w r I rVL.1 L- 


874 


^VfSFnHNFWT^F YWOPYALPL 


875 


OTPFTWFFSNAY YWQPYALPL 


876 


FNIPFTWOF^IMAY YWOPYALPL 


877 


A/TPFTWFD^NIVF YWOPYALPL 


878 


QIPFTWEQSNAY YWQPYALPL 


879 


QAPLTWQESAAY YWQPYALPL 


880 


EPTFTWEESKAT YWQPYALPL 


881 


TTTLTWEESNAY YWQPYALPL 


882 


ESPLTWEESSAL YWQPYALPL 


883 


ETP LTWEES NAY YWQPYALPL 


884 


EATFTWAESNAY YWQPYALPL 


"~ 885 


EALFTWKESTAY YWQPYALPL 


886 


STP-TWEESNAY YWQPYALPL 


887 


ETPFTWEESNAY YWQPYALPL 


888 


KAPFTWEESQAY YWQPYALPL 


889 
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STSFTWEESNAY YWQPYALPL 


890 


DSTFTWEESNAY YWQPYALPL 


891 


Yl PFTWEESNAY YWQPYALPL 


892 


QTAFTWEESNAY YWQPYALPL 


893 


FTI FTWEESNAT YWQPYALPL 


894 


V^FTWEESNAY YWQPYALPL 


895 


OPYAl PL 


896 


Pv-1 -NanPYOJYALPL 


897 


TANV^FEWTPG YWQPYALPL 


898 


FFWTPfiYWOPYALPL 


899 


FFWTPfiYWO JYALPL 


900 


FP\A/TP(^YYn IYAI PL 

"CVV 1 \\J T I UU I AAl-l I— 


901 


CTDCT\A/CPQMAW\A/nPYAI PI 


902 


CT\A/PCQMAVY\A/n IVAI PI 
r 1 WttolNMT TVVVJJTMLrL 


903 


An\/| NAA/HPVA P\/TI WW 

AUVL YVVUr YM r V I LW v 


904 


/\n\ / A C V\A/HDVA 1 Dl TCI 

CaUVAb YWUrYA LrLloL 


905 


SWTDYG YWUrYA LrlooL 




FEWTPGYWUrYALrL 


91 1 

711 


FEWTPGYWQJYALrL 




FE WTPG WYQ P Y ALP L 




r-i-»*prn^lAA/r\ ivy a 1 Dl 

FEWTPGWYQJYALPL 


✓ i*t 


FEWTPGYYQPYALr'L 




FEWTPGYYQJYALrL 


916 


TAM\/COCC\A/TD<^V\A/nDVAI PI 

TANVoorbW 1 roYWUrYALrL 


918 


SWI DYGYWUr'YALrlovaL 


919 


CTD CT\A/C C CM A W\A/HP V A 1 PI 

1 1 rr 1 WttoiNAY TWUrYMLrL 


920 


CMTVCDMXA/AnQMVXA/OPVAl PI 


921 


OV/ACnUMnA/TQPYWnPYAI PI 
OVbtUnlNrVV 1 OCTVVUr TALrU 


922 


npvnD\A/nnQ^PPV\A/nPYAI PI 


923 


CP\A/TP<^V\A/nPYA 1 PI 
rtW 1 rVJ Y VVVJr I MLrL 


924 


CC\A/TD^V\A/nPV 
rtWI i Y VVvJr T 


925 


rtW 1 r\j Y VV\JJ T 


926 


C\A/TD/^V\A/nDV 
LW 1 r ijl Y VVVJr Y 


927 


CJT\A/TDf2\A/VO IY 


928 


At W 1 r O Y VV^u Y 


929 


i C A\A/T*D/^V\A/Pl IV 

rAW 1 r\j Y wljj Y 


930 


CCA TD/^VIA/H IV 

: rfcA 1 r La Y WvJJ Y 


931 


CCWV/APl^VM/O IV 
rtWArb Y WUJ Y 


932 


CC\A/TAPV\A/n IV 

rhW 1 AoYVvaJJ Y 


933 


FEWTPAYWQJY 


934 


FEWTPGAWQJY 


935 


FEWTPGYAQJY 


936 


FEWTPGYWQJA 


937 


FEWTGGYWQJY 


938 


FEWTPGYWQJY 


939 


FEWTJGYWQJY 


940 


FEWTPecGYWQJY 


941 


FEWTP AibYWQJY 


942 


FEWTPSarWYQJY 


943 


FEWTSarGYWQJY 


944 
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FEWTPNYWQJY 


945 


FEWTPVYWQJY 


946 


FEWTVPYWQJY 


947 


AriFFWTPGWYQJY 


948 


ArFFWTPGYWOJY 


949 


INan-FWTPGYYQJY 


950 


YFWTPGYYQJY 


951 


FFVAA/PfiYYOJY 


952 


FFWTPfiYYOJY 


953 


FFWTP^YYOJY 


954 


F F WTP n Y YO. J Y 


955 


c*HI Y-Nan-OPYSVQM 


956 


Tl VY-Nan-OPYSLQT 


957 


RGDY-Nan-QPYSVQS 


958 


NMVY-Nan-GPYSIQT 


959 


VYWOPY^VO 

V T VVvjcr T O VM 


960 


V Y- Nl a n-O P YP5VO 


961 


TFV/YWO IYAI PI 


962 


FF\A/TPf5 YYO J- Rn?i 


963 


YQQpPWTPnwn l-Rna 

• AddrCVV 1 ru T T VJO DfJa 


964 


FFVA/TPH Y-Rna-O. IY 


965 


A^CP\A/TPfSV-Rna-0 IY 


966 


crc\Arrp^5 Rna.Vn IY 
rtVv I ru Dpa* T VJU T 


967 


ACrfcW 1 rta-Dpa-YvJJT 


968 


ACrb-bpa- 1 ruiTUJY 


969 


ACrE-Dpa- 1 rtiYYUJY 


970 


Dpa-fcW 1 roYYUJT 


971 


ACbpa-tW 1 rbYYUJi 


972 


\ f\s\Ai/-\ Dvc\/n 

VYWUrYoVU 


973 


Dl \/V)A/nDVC\/r\D 

KLVYWU"YoV<Jn 


974 


ni \/v Mor* nPVQ\/HR 

riLV Y-fMap-vjn Y ovvjn 


975 


qi nviA/nPVQVOR 


976 


nLVVVrUr Tovun 


977 


QI VMA/nPY^IOR 


978 


nMQQWVn^FI 1 
UINOOVV T UOrLL 


980 


nMTAWYF^FI A 


981 


nMTAWYFNFLL 


982 


PARC nNTAWYDSFLI WC 


983 


tc;fy DNITTWYFKFLA SQ 


984 


SQIP DNTAWYQSFLL HG 


985 


SPFI DNTAWYENFLLTY 


986 


EQIY DNTAWYDHFLL SY 


987 


TPFI DNTAWYENFLLTY 


988 


TYTY DNTAWYERFLM SY 


989 


TMTQ DNTAWYENFLL SY 


990 


Tl DNTAWYANLVQ TYPQ 


"~ 991 


Tl DNTAWYERFLA QYPD - 


992 


HI DNTAWYENFLL TYTP 


993 


SQ DNTAWYENFLL SYKA 


994 


QI DNTAWYERFLL QYNA 


995 
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NQ DNTAWYESFLL QYNT 


996 


Tl DNTAWYENFLL NHNL 


997 


HY DNTAWYERFLQ QGWH 


998 


ETPFTWEESNAYYWQPYALPL 


999 


YIPFTWEESNAYYWQPYALPL 


1000 


DGYDRWRQSGERYWQPYALPL 


1001 


pY-INap-pY-QJYALPL 


1002 


TANVSSFEWTPGYWQPYALPL 


1003 


FEWTPGYWQJYALPL 


1004 


FEWTPGYWQPYALPLSD 


1005 


FEWTPGYYQJYALPL 


1006 


FEWTPG YWQJ Y 


1007 


ArFF WTPG YWQJY 

AvlL.iV 1 1 v^l 1 vv\«U • 


1008 


ArFEWTPGWYQJY 


1009 


AcFEWTPGYYQJY 


1010 


ArFFWTPaYWQJY 


1011 


ArFFWTPflWYOJY 


1012 


AcFEWTPaYYQJY 


1013 


FEWTPGYYQJYALPL 


1014 


FEWTPGYWQJYALPL 


1015 


FEWTPGWYQJYALPL 


1016 


TANVSSFEWTPGYWQPYALPL 


1017 


AcFEWTPGYWQJY 


1018 


AcFEWTPGWYQJY 


1019 


AcFEWTPGYYQJY 


1020 


AcFEWTPAYWQJY 


1021 


AcFEWTPAWYQJY 


1022 


AcFEWTPAYYQJY 


1023 



31 



WO 00/24782 



PCT7US99/25044 



Table 5 — EPO-mimetic peptide sequences 





SEQ 
ID NO 


YXCXXGPXTWXCXP 


OJ 


YXCXXGPXTWXCXP-YXCXXGPXTWXCXP 


84 


YXCXXGPXTWXCXP-A-YXCXXGPXTWXCXP 


85 


YXCXXGPXTWXCXP-A- eam . ne) 


86 


\ 

IS 




6A 

YXCXXGPXTWXCXP-A- (a-amine) 


OA 
OO 


GGTYSCHFGPLTWVCKPQGG 


87 


GGDYHCRMGPLTWVCKPLGG 


88 


GGVYACRMGPITWVCSPLGG 


89 


VGNYMCHFGPITWVCRPGGG 


90 


GGLYLCRFGPVTWDCGYKGG 


91 


GGTYSCHFGPLTWVCKPQGG- 

r'^TvcrucrtDi T\A/\/r N k , Por^r^ 
(jo 1 YoOMrbrL 1 WVOrSrvjvjio 




GGTYSCHFGPLTWVCKPQGG -A- 
GGTYSCHFGPLTWVCKPQGG 


93 


GGTYSCHFGPLTWVCKPQGGSSK 


94 


GGTYSCHFG PLTWVCKPQGGSSK- 
GGTYSCHFG PLTWVCKPQGGSSK 


95 


GGTYSCHFGPLTWVCKPQGGSSK-A- 
GGTYSCHFGPLTWVCKPQGGSSK 


96 


GGTYSCHFG PLTWVCKPQGGSS. 


97 


/ 

.PA 

GGTYSCHFG P LTW VCKPQGGSS (a-amine) 


97 




GGTYSCHFGPLTWVCKPQGGSSK(-A-biotin) 


98 


CXXGPX K TWX,C 


421 


GGTYSCHGPLTWVCKPQGG 


422 


VGNYMAHMGPITWVCRPGG 


423 


GGPHHVYACRMGPLTWIC 


424 


GGTYSCHFGPLTWVCKPQ 


425 


GGLYACHMGPMTWVCQPLRG 


_ 426 


TIAQYICYMGPETWECRPSPKA 


427 


YSCHFGPLTWVCK 


428 


YCHFGPLTWVC 


429 


X„XXGPXJWX,X,, 


124 


YX„X„X,X<GPX;rWX,X„ 


461 



28- 
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y vyyy x gpx.taaoocxxjc 


419 


Y YY CY X_GPX.TWX,CX„X^X.. 


420 


GGLYLCRFGPVTWDCGYKGG 


1024 


GGTYSCHFGPLTWVCKPQGG 


1025 


GGDYHCRMGPLTWVCKPLGG 


1026 


VGNYMCHFGPITWVCRPGGG 


1029 


GGVYACRMGPITWVCSPLGG 


1030 


VGNYMAHMGPiTWVCRPGG 


1035 


GGTYSCHFGPLTWVCKPQ 


1036 


GGLYACHMGPMTWVCQPLRG 


1037 


TIAQYICYMGPETWECRPSPKA 


1038 


YSCHFGPLTWVCK 


1039 


YCHFGPLTWVC 


1040 


SCHFGPLTWVCK 


1041 


(AX,)XXXGPXJWXX 


1042 
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Table 6— TPO-mimetic peptide sequences 



Qo/iiionrp/cfnirhifP 


SEQ 
ID NO: 


IEGPTLRQWLAARA 


13 


1 EGPTLRQWLAAKA 


24 


IEGPTLREWLAARA 


2d 


IEGPTLRQWLAARA-A-IEGPTLRQWLAARA 


26 


IEGPTLRQWLAAKA-A-IEGPTLRQWLAAKA 


27 


IEGPTLRQCLAARA-A-IEGPTLRQCLAARA 
I 1 


28 


IEGPTLRQWLAARA-A-K(BrAc)-A-IEGPTLRQWLAARA 


29 


IFGPTl ROWUVARA-A-K(PEG)-A-IEGPTLRQWLAARA 


30 


ipi^pti Ron A AR A- A -IEGPTLRQWLAARA 

1 


31 


1 

IPHPTI ROOI AARA-A-IEGPTLRQWLAARA 


31 


IEGPTLRQWLAARA-A-IEGPTLRQCLAARA 


32 


I 

IEGPTLRQWLAARA-A-IEGPTLRQCLAARA 


32 


VRDQIXXXL 


33 


TLREWL 


34 


GRVRDQVAGW 


35 


GRVKDQIAQL 


36 


GVRDQVSWAL 


37 


ESVREQVMKY 


38 


SVRSQISASL 


39 


GVRETVYRHM 


40 


GVREVIVMHML 


41 


GRVRDQIWAAL 


42 


AGVRDQILIWL 


43 


GRVRDQIMLSL 


44 


GRVRDQI(X),L 


45 


CTLRQWLQGC 


46 


CTLQEFLEGC 


47 


CTRTEWLHGC 


48 


CTLREWLHGGFC 


49 


CTLREWVFAGLC 


50 


CTLRQWLI LLGMC 


51 


CTLAEFLASGVEQC 


52 


CSLQEFLSHGGYVC 


53 


CTLREFLDPTTAVC 


54 , 


CTLKEWLVSHEVWC 


55 


CTLREWL(X),X 


56-60 


REGPTLRQWM 


61 


EGPTLRQWLA 


62 


ERGPFWAKAC 


63 


REGPRCVMWM 


64 


CGTEGPTLSTWLDC 


65 



^0 
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C EQ DG PTLLE WLKC 


OO 


CELVGPSLMSWLTC 


O/ 


C LTG PFVTQWL YEC 




PDA^lDTI 1 P\A/f T1 P 
UnAbr 1 LLCVVL. 1 UO 


69 


CADGPTLREWISFC 


70 


C(X), ,EGPTLREWL(X),.,C 


71-74 


GGCTLREWLHGGFCGG 


75 


GGCADGPTLREWISFCGG 


76 


GNADGPTLRQWLEGRRPKN 


77 


LAIEGPTLRQWLHGNGRDT 


78 


HGR VG PTLREWKTQ VATKK 


79 


TIKGPTLRQWLKSREHTS 


80 


ISDGPTLKEWLSVTRGAS 


81 


SIEGPTLREWLTSRTPHS 


82 
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Table 7— G-CSF-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


EEDCK 


99 




99 


1 

EEDCK 


99 


EEDaK 


100 


EEDaK 


100 


1 

EEDaK 


100 


dGIuEDctK 


101 


pGluEDaK 
1 


101 


PGluEDaK 


101 


PicSDaK 


102 


PicSDaK 


102 


1 

PicSDaK 


102 


EEDCK-A-EEDCK 


103 


EEDXK-A-EEDXK 


104 
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Table 8 — TNF-antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


YCFTASENHCY 


106 


YCFTNSENHCY 


107 


YCFTRSEN HCY 


108 


FCASENHCY 


109 


YCASENHCY 


110 


FCNSENHCY 


111 


FCNSENRCY 


112 


FCNSVENRCY 


113 


YCSQSVSNDCF 


1 14 


FCVSNDRCY 


115 


YCRKELGQVCY 


116 


YHKFPGQCY 

1 V_/l\» — 1 VJV-»cV-» I 


117 


YCRKEMGCY 


118 


FCRKEMGCY 


119 


YCWSQNLCY 


120 


YCELSQYLCY 


121 


YCWSQNYCY 


122 


YCWSQYLCY 


123 


DFLPHYKNTSLGHRP 


1085 


AA,-AB, 

\ 

AC 

/ 

AA,-AB, 


NR 





WO 00/24782 



PCT/US99/25044 



Table 9— Integrin-binding peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


RX«ETX,WX, 


441 


RX,ETX,WX„ 


442 


RGDGX 


443 


CRGDGXC 


444 


CX^RLDXXC 


445 


CARRLDAPC 


446 


CPSRLDSPC 


447 


X XJCRGDXXX. 


448 




449 




450 


HDCRGDCLC 


451 


CLCRGDCIC 


452 


X X DDX X.XJC 


453 


X XJCDDX X.X,X,X Q 


454 


nWDDGWLC 


455 


HWDDLWWLC 


456 


, CWDDGLMC 


457 


GWDDGWMC 


458 


CSWDDGWLC 


459 


nPDDLWWLC 


460 


NGR 


NR 


GSL 


NR 


RGD 


NR 


CGRECPRLCQSSC 


1071 


GNGROVSGCAGRC 


1072 


CA SGSLSC 


1073 


RGD 


NR 


NGR 


NR 


G^l 


NR 


NGRAHA 


1074 


CNGRC 


1075 


CDCRGDCFC 


1076 


CGSLVRC 


1077 


DLXXL 


1043 


RTDLDSLRTYTL 


1044 


RTDLDSLRTY 


1053 


RTDLDSLRT 


1054 


RTDLDSLR 


1078 


GDLDLLKLRLTL 


1079 


GDLHSLRQLLSR 


1080 


RDDLHMLRLQLW 


1081 


SSDLHALKKRYG 


1082 


RGDLKQLSELTW 


1083 


RGDLAALSAPPV 


1084 



Ml 
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Table 10 — Selectin antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


DITWDQLWDLMK 


147 


DITWDELWKIMN 


148 


, DYTWFELWDMMQ 


149 


QITWAQLWNMMK 


150 


DMTWHDLWTLMS 


151 


DYSWHDLWEMMS 


152 


EiTWDQLWEVMN 


153 


HVSWEQLWDIMN 


154 


HITWDQLWRIMT 


155 


RNMSWLELWEHMK 


156 


AEWTWDQLWHVMNPAESQ 


157 


HRAEWLALWEQMSP 


158 


KKEDWLALWRIMSV 


159 


ITWDQLWDLMK 


160 


DITWDQLWDLMK 


161 


DITWDQLWDLMK 


162 


DITWDQLWDLMK 


163 


CQNRYTDLVAIQNKNE 


462 


AENWADNEPNNKRNNED 


463 


RKNNKTWTWVGTKKALTNE 


464 


KKALTNEAENWAD 


465 


CQXRYTDLVAIQNKXE 


466 


RKXNXXWTWVGTXKXLTEE 


467 


AENWADGEPNNKXNXED 


468 


CXXXYTXLVAIQNKXE 


469 


RKXXXXWXWVGTXKXLTXE 


470 


AXNWXXXEPNNXXXED 


471 


XKXKTXEAXNWXX 


472 
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Table 11 — Antipathogenic peptide sequences 



Sequence/structure 


SEQ 


ID NO: 


GFFALIPKIISSPLFKTLLSAVGSALSSSGGQQ 


503 


GFFALIPKIISSPLFKTLLSAVGSALSSSGGQE 


504 


GFFALIPKIISSPLFKTLLSAV 


505 


nFFAl IPK1ISSPLFKTLLSAV 


506 


H^FFAI IPKIISSPLFKTLLSAV 


507 


KK^FFAI IPKIISSPLFKTLLSAV 


508 


KK^FFAI IPKIISSPLFKTLLSAV 


509 


(nFFAl IPKIIS 


510 


nmAVLKVLTTGLPALISWIKRKRQQ 


511 


nifiAVLKVLTTGLPALISWIKRKRQQ 


512 


r^iriAVi HVI TTGLPALISW1KRKRGQ 


513 


rsif^AVI HVI TTGLPALISWIKR 


514 


AX/1 HVI TTf^l PAI I^WIKR 


515 


u'l i i i i hi i i i h 


516 


l/i i i wi i i k'l 1 k' 


517 


HI 1 1 HI HI HI 1 H 
W LL L l\ l_l\ Ur\ Ll_l\ 


518 


HHI 1 WI 111 HI Hk 


519 


HI 1 1 HI 1 1 HI 1 H 


520 


HI 1 1 HI HI HI 1 H 


521 


HI 1 1 1 H 


522 


KLLLKLLIS 


523 


KLLLr\LI\Lr\LLi\ 


524 




525 


iy 1 ■ i i/i ixi I/I l l/' 


526 


is a * AI/AA AI/AAW 


527 


ix\ a/\ /l/\/\/\/H\/\/H 


528 


H\A/\/HV/H\/H\/\/H 
l\V V Vf\VI\.Vr\V Vl\ 


529 


H\AA/H\/H\/HVH 
l\V V VI\VI\VI\Vr\ 


530 


H\A/\/H\/H\/H\/\/H 


531 


HI II HI 
r\l_ILr\L_ 


532 


l/A/l Ull 1 


533 


1 1/1 PI 1 


534 


HPI Ml 1 


535 


KLILKLVR 


536 


KVFHLLHL 


537 


HKFRILKL 


538 


KPFHILHL 


539 


KIIIKIKIKIIK 


540 


KIIIKIKIKUK 


541 


KIIIKIKIKIIK 


542 


KIPIKIKIKIPK 


543 


KIPIKIKIKIVK 


544 " 


RIIIRIRIRIIR 


545 


RIIIRIRIRIIR 


546 


RIIIRIRIRIIR 


547 


RIVIRIRIRLIR 


548 



% 
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RIIVRIRLRIIR 


549 


RM^IRl RVRIIR 


550 


KIVIRIRIRI IR 


551 


r\ tr\ v r\vv r»i_ni ir\ 


552 


WlfSWKl RX/RIIR 
VV l\Lri v ni i n 


553 


l\r\l \a vv lh n v n n 


554 


0|\/IDIDIQI IPIP 


555 


QIIWBIDI DIIDV/R 
rill VnlnLrlHnVn 


556 


HIGIHLnVMurlnV 


557 


l/IV/IDID A Dl IDIDID 

KIVIHInAnLlnlnlrt 


558 


DIIV/1/IDI DIII/t/tDI 

RIIVKIHLHIiKAIHL 


559 


KIGIKARVHNHVISII 


560 


r>ll\ /LJIDI D 1 ILJLJIDI 

RIIVHIHLHIIrlnlHL 


561 


l_l 1 1 %/ A LJ\/Q 1 1 QWLJ 1 1 


562 


niw /L/ILJf DN/ll/'l/'l D 1 

RIYVr\lnLKYIf\l\IKL 


563 


KIGnKARVnllriYK.ll 


564 


RIYVKPHPRYIKKIHL 


565 


KPGHKARPHIIHYKIl 


566 


i/i\/idididi intni ni/|\/ 

KIVIRIRIRLIRIRIRKIV 


567 


RilVKiRLRIIKKIHLIKK 


568 


KIGWKLRVRIIRVKIGRLH 


569 


iv i\ /tninmi imniDL/lwl/\/l/DID 

KIVIRIRIRLIRIRIRKIVKVKHIR 


570 


RFAVKIRLRIIKKIRLIKKIHKHVIK 


571 


I/a/ma/I/I n\/niiD\/i/ir*DI Dl/'IfMA/U'L^DV/Dlll' 

KAGWKLRVRIIRVKIGRLHKlGWKKnvnlK 




ni\/t/i/ni innwil/l/IDI 

RIYVKPHPRYIKKIRL 


57^ 


KPGHKARPHIIRYKII 


574 


\s\\ /ininini mmini/i\/ 

KIVIRIRIRLIRIRIRKIV 


575 


nm/i/ini nni/i/iDI 11/1/ 

RIIVKIRLRIIKKIRLIKK 


576 


RIYVSKISIYIKKIRL 


577 


KIVIFTRIRLTS1RIRSIV 


578 


KPIHKARPTIIRYKMI 


579 


cyclicCKGFFALIPKIISSPLFKTLLSAVC 


580 


CKKGFFALIPKIISSPLFKTLLSAVC 


581 


CKKKGFFALIPKIISSPLFKTLLSAVC 


582 


CyclicCRIVIRIRlRLIRIRC 


583 


CyclicCKPGHKARPHIIRYKIIC 


584 


CyclicCRFAVKIRLRIIKKIRLIKKIRKRVIKC 


585 


KLLLKLLL KLLKC 


586 


KLLLKLLLKLLK 


587 


KLLLKLKLKLLKC 


588 


KLLLKLLLKLLK 


589 



V 
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Table 12 — VIP-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


H^DAVFYDNYTR LRKQMAVKKYLN SILN 


590 


NIp H^DAVFYDNYTR LRKQMAVKKYLN SILN 


591 


v y • v « v 


592 


v o Y I M 


593 


row rw rn kkyy^ NH OH CO X6 

INri Ori vU r\r\T/\0 in n on v^v-/ au 


594 


1 1 
fCH2^m Z (CH2)n 




KKYL 


595 


NS1LN 


596 


KKYL 


597 




598 


AVKKYl 


599 


INOIL.IN 


600 


r\r\ t v 


601 


oil oiiM 

OILaUIN 


602 


r\r\ t uiNit; 


603 


MQVI Nl 

INO T LIN 


604 


rMol T IN 


605 


r\ ft, Y L r r IN o 1 U 1 N 


606 


LaUr\l\YL 


607 




608 


ftYL 


NR 


i\r\Yi\iie 


609 


Vftft YL 


610 


LINolLIN 


611 


VI MQII M 
T LIN Ol LIN 


612 


r\r\ T LIN 


613 


r\r\ T LINO 


614 


r\r\ t linoi 


615 


KkYI NSII 


616 


KKYL 


617 


KKYDA 


618 


AVKKYL 


619 


NSILN 


620 


KKYV 


621 


SILauN 


622 


NSYLN 


623 


NSIYN 


624 


KKYLNle 


625 


KKYLPPNS1LN 


626 


KKYL 


627 


KKYDA 


^628- 


AVKKYL 


629 


NSILN 


630 


KKYV 


631 


SILauN 


632 



1/ 
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1 ankWI 


633 


Oaprxrv T L M 


634 


KYL _ - 


NR 


r\YL _ 


NR 


r\l\YiNie 


635 


VKKYL 


636 


LNSILN 


637 


YLNSILN 




KKYLNIe 




KKYLN 


otu 


KKYLNS 




KKYLNSI 




KKYLNSIL 




KKKYLD 


A/14 


cyclicCKKYLC 




CKKYLK 
i i 


AAA 


i i 

S-CH^-CO 




KKYA 


647 


WWTDTGLvv 


648 


WWTDDGLW 


649 


iin>mTn/M \ A l\ MA f 1 1 

WWDTRG LW VWT 1 


650 


FWGNDGIWLESG 


651 


DWDQFGLWRGAA 


652 


RWDDNGLWVVVL 


653 


SG M WSHYG 1 WMG 


654 


GGRWDQAGLWVA 


655 


KLWSEQGIWMGh 


656 


CWSMHGLWLC 


657 


GCWDNTG1WVPC 


658 


D WDTRG LWVY 


659 


fil wnPNGAWI 


660 


KWDDRGLWMH 


661 


QAWNERGLWT 


662 


Q WDTRG LWVA 


663 


WNVHGIWQE 


664 


SWDTRGLWVE 


665 


DWDTRGLWVA 


666 


SWGRDGLWIE 


667 


EWTDNGLWAL 


668 


SWDEKGLWSA 


669 


SWDSSGLWMD 


670 



*/9 
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Table 13— Mdm/hdm antagonist peptide sequences 



Sequence/structure 


ID NO: 


TFSDLW 


130 


QETFSDLWKLLP 


131 


QPTFSDLWKLLP 


132 


QETFSDYWKLLP 


133 


QPTFSDYWKLLP 


134 


MPRFMDYWEGLN 


135 


VQNFIDYWTQQF 


136 


TGPAFTHYWATF 


137 


IDRAPTFRDHWFALV 


138 


PRPALVFADYWETLY 


139 


PAFSRFWSDLSAGAH 


140 


PAFSRFWSKLSAGAH 


141 


PXFXDYWXXL 


142 


QETFSDLWKLLP 


143 


QPTFSDLWKLLP 


144 


QETFSDYWKLLP 


145 


nPTPRDYWKLLP 


146 


Table 14 — Calmodulin antagonist peptide sequences 


Sequence/structure 


SEQ 
ID NO: 


SCVKWGKKEFCGS 


164 


SP.WKYWGKECGS 


165 


SCYEWGKLRWCGS 


166 


SCLRWGKWSNCGS 


167 


SCWRWGKYQICGS 


168 


SCVSWGALKLCGS 


169 


SCIRWGQNTFCGS 


170 


SCWQWGNLKICGS 


171 


SCVRWGQLSICGS 


172 


LKKFNARRKLKGAILTTMLAK 


173 


RRWKKNFIAVSAANRFKK 


174 


RKWQKTGHAVRAIGRLSS 


175 


INLKALAALAKKIL 


176 


KIWSILAPLGTTLVKLVA 


177 


LKKLLKLLKKLLKL 


178 


LKWKKLLKLLKKLLKKLL 


179 


AEWPSLTEI KTLSHFS V 


180 


AEWPSPTRVISTTYFGS 


181 


AELAHWPPVKTVLRSFT 


182 " 


AEGSWLQLLNLMKQMNN 


183 


AEWPSLTEiK 


184 



f0 
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Table 15— Mast cell antagonists/Mast cell protease inhibitor 



peptide sequences 



Sequence/structure 


SEO 
ID In\J: 


SGSGVLKRPLPILPVTR 


272 


RWLSSRPLPPLPLPPRT 


273 


GSGSYDTLALPSLPLHPMSS 


274 


GSGSYDTRALPSLPLHPMSS 




GSGSSGVTMYPKLPPHWSMA 


276 


GSGSSGVRMYPKLPPHWSMA 


277 


GSGSSSMRMVPTIPGSAKHG 


278 


RNR 


NR 


QT 


NR 


RQK 


NR 


NRQ 


NR 


RQK 


NR 


RNRQKT 


436 


RNRQ 


437 


RNRQK 


438 


NRQKT 


439 


RQKT 


440 
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Table 16— SH3 antagonist peptide sequences 



Sequence/structure 


ID NO: 


RPLPPLP 


282 


RELPPLP 


283 


SPLPPLP 


284 


GPLPPLP 


285 


RPLPIPP 


286 


RPLPIPP 


287 


RRLPPTP 


288 


RQLPPTP 


289 


RPLPSRP 


290 


RPLPTRP 


291 


SRLPPLP 


292 


RALPSPP 


293 


RRLPRTP 


294 


RPVPPIT 


295 


ILAPPVP 


296 


RPLPMLP 


297 


RPLPILP 


298 


RPLPSLP 


299 


RPLPSLP 


300 


RPLPMIP 


301 


RPLPLIP 


302 


RPLPPTP 


303 


RSLPPLP 


304 


RPQPPPP rr 


305 


RQLPIPP 


306 


XXXRPLPPLPXP 


307 


XXXRPLPP1PXX 


308 


XXXRPLPPLPXX 


309 


RXXRPLPPLPXP 




RXXRPLPPLPPP 


311 


pppyppppipxx 


312 


PPPYPPPPVPXX 


313 


LXXRPLPXW 


314 


*FXXRPLPXLP 


315 


PPX0XPPP*FP 


316 


+PPWXKPXWL 


317 


RPX^PW+SXP 


318 


PPVPPRPXXTL 


319 


vj/pvpLpxj/K 


320 


+0DXPLPXLP 


321 
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Table 17— Somatostatin or cortistatin mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


v 1 _v 2 _Aon.Pho-Pho-Trn-I vft-Thr-Phe-X 3 -Ser-X 4 


473 


Acn Am Mot Pro Hvc. Am Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cys Lys 


474 


Mot Pm r.v/Q Am Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cys Lys 


475 


rvc Am Acn PhA Phe Tra Lvs Thr Phe Ser Ser Cys Lys 


476 


Acn Am Mot Prn Hv<t Am Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cys 


477 


Mot Prn r.uc Am Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cys 


478 


r.wc Am Acn Pho Pho Tm Lvs Thr Phe Ser Ser Cvs 


479 


Ac?n Am Mpt Pro Cvs I vs Asn Phft Phe Trp Lys Thr Phe Ser Ser Cys 


480 


Mot Pm rs/c i vc Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cys Lys 


481 


Pv/q i w Acn Pho Pho Tro Lvs Thr Phe Ser Ser Cys Lys 


482 


Acn Am Mot Prn Ovs I vs Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cys 


483 


Mat Dm fN/o i \/c Acn Pho Pho Trn Lvs Thr Phe Ser Ser Cvs 


484 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 


485 


Asp Arq Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


486 


Met Pro Cys Arq Asn Phe Phe Trp Lvs Thr Phe Thr Ser Cys Lys 


487 


Cys Arq Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


488 


Asp Arg Met Pro Cys Arq Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


489 


Met Pro Cvs Arq Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


490 


Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


491 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


492 


Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


493 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


494 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


495 


Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


496 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


497 
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Table 18— UKR antagonist peptide sequences 



Sequence/structure 


SEQ 


ID NO: 


AFPMPUSI NFSOYLWYT 


196 


ApujYfiSI WDTYSPLAF 


197 


API ni WMRHYPLSFSNR 


198 


AF^i WTRYAWPSMPSY 


199 


a pwm Pfi LS FGS YLWSKT 


200 


AFPAl 1 IMWSFFFNPGLH 


201 


A FW^FYNI LH LP E POTI F 


202 


APPI ni WSLYSLPPLAM 


203 


appti WOI YOFPLRLSG 


204 


AEISFSELMWLRSTPAF 


205 


AELSEADLWTTWFGMGS 


206 


AESSLWRIFSPSALMMS 


207 


AESLPTLTSILWGKESV 


208 


AETLFMDLWH DKH 1 LLT 


209 


AEILNFPLWHEPLWSTE 


210 


AESQTGTLNTLFWNTLR 


211 


AEPVYQYELDSYLRSYY 


430 


AELDLSTFYDIQYLLRT 


431 


AEFFKLGPNGYVYLHSA 


432 


FKLXXXGYVYL 


433 


AESTYHHLSLGYMYTLN 


434 


YHXLXXGYMYT 


435 
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Table 19 — Macrophage and/or 



T-cell inhibiting peptide sequences 



Sequence/structure 




ID NO: 


Xaa-Yaa-Arq 


NR 


Arg-Yaa-Xaa 


NR 


Xaa-Arg-Yaa 


NR 


Yaa-Arg-Xaa 


NR 


Ala-Arg 


NR 


Arg-Arg 


NR 


Asn-Arg 


NR 


Asp-Arq 


NR 


Cys-Arg 


NR 


Gin-Am 


NR 


Glu-Arn 


NR 


Glv-Arn 


NR 


His-arn 


NR 


|je~Arg 


NR 


I pn-Arn 


NR 


1 v<;-Arn 


NR 


Mpt-Arn 


NR 




NR 


Sfir-Arn 


NR 


Thl*-Arg 


NR 


Tro-Ara 


NR 


r ? — 

Tvr-Arq 

' y H — — 


NR 


Val-Arg 


NR 


Ala-Glu-Arg 


NR 


Arg-Glu-Arg 


NR 


Asn-Glu-Ard 


NR 


Asp-Glu-Ara 


NR 


Cys-Glu-Arg 


NR 


Gln-Glu-Arg 


NR 


Glu-Glu-Arg 


NR 


Gly-Glu-Arg 


NR 


His-Glu-Arg 


NR 


lle-Glu-Arg 


NR 


Leu-Glu-Arg 


NR 


Lys-GIu-Arg 


NR 


Met-Glu-Arg 


NR 


Phe-Glu-Arg 


NR 


Pro-Glu-Arg 


NR 


Ser-Glu : Arq 


- NR 


Thr-Glu-Arg 


NR 


Trp-Glu-Arg 


NR 


Tyr-Glu-Arg 


NR 


Val-Glu-Arg 


NR 
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Am-. A la 


NR 


A m -Acn 


NR 


A r/i _0 v/c 


NR 


Arn-f^ln 


NR 


A rri.f^h i 


NR 


Am 


NR 


A r*rt_l— lio 


NR 


Arg-lle . „ — — . — 


NR 


Arg-Leu m .. 


NR 


Arg-Lys 


NR 


Arg-M6t _ 


NR 


Arg-Pne 


NR 


Arg-Pro . 


NR 


Arg-Ser . _ 


NR 


Arg-Thr : 


NR 


Arg-Trp 


NR 


Arg-Tyr _ „ . 


NR 


Arg-vai . 


NR 


Arg-\jiu-Aia _ „ 


NR 


Arg-Glu-Asn 


NR 


Arg-Glu-Asp ■ 


NR 


Arg-Glu-Cys 


NR 


Arg-Glu-GIn 


NR 


Arg-GIu-Glu 


NR 


Arg-Glu-Gly 


NR 


Arg-Glu-His 


NR 


Arg-GIu-lle 


NR 


Arg-Glu-Leu 


NR 


Arg-Glu-Lys - 


NR 


Arg-Glu-Met 


NR 


Arg-Glu-Phe 


NR 


Arg-Glu-Pro 


NR 


Arg-Glu-Ser 


NR 


Arg-GIu-Thr 


NR 


Arg-oiu- 1 rp 


NR 


Arg-Glu-Tyr 


NR 


Arg-oiu-vai 


NR 


A 1 n A wf lit 

Aia-Arg-^jiu . 


NR 


Arg-Arg-Glu 


NR 


Asn-Arg-Glu 


NR 


Asp-Arg-Glu 


NR 


Cys-Arq-Glu 


NR 


GIn-Arg-Glu 


NR 


Glu-Arg-Glu 


NR 


Gly-Arg-Glu 


NR 


His-Arg-Glu : — 


- NR 


lle-Arg-Glu 


NR 


Leu-Arq-Glu 


NR 


Lys-Arg-Glu 


NR 


Met-Arg-Glu 


NR 
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Php-Arn-Glu 


NR 


Prn-Arn-f^h 1 


NR 


£ar-Arn-f^lii 


NR 


Thr-Arn-f^li i 


NR 


Xrn-Arn -rtli i 


NR 


"T\/r- Arn-fnh i 


NR 


V/al-Am-f^ln 


NR 


/Till i.Arri-Ala 


NR 


flli i _ A rn _ A rn 


NR 


/"^ 1 1 i_Arv*«_Aon 


NR 


/"2 lit A rr\ Acn 

oiu-Mrg-Msp - — ■ 


NR 


/"ill ■ Am wo 


NR 


jjiu-Arg-vain — 


NR 


vjiu-Mrg-oiy _ . 


NR 




NR 


Glu-Arg-lle 


NR 


Glu-Arg-Leu 


NR 


Glu-Arg-Lys 


NR 


Glu-Ara-Met 


NR 


Glu-Arg-Phe 


NR 


Glu-Arg-Pro 


NR 


Glu-Arg-Ser 


NR 


Glu-Arg-Thr 


NR 


Glu-Arg-Trp 


NR 


Glu-Arg-Tyr 


NR 


Glu-Arg-Val 


NR 



*1 
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Table 20— Additional Exemplary Pharmacologically Active Peptides 



Sequence/structure 


SEQ 
ID 


Activity 


VEPNCDIHVMWEWECFERL 


luz/ 


v cor-aniagonisi 


GERWCFDGPLTWVCGEES 


• IUo4 


VtzVJi -an lay Oil lb I 


RGWVEICVADDNGMCVTEAQ 




vcvjir-aniayur iibi 


GWDECDVARMWEWECFAGV 


iUoO 


vtur- aniayunisi 


G E RWC FDG PRAWVCG WE 1 


501 


vtor- antagonist 


EELWCFDGPRAWVCGYVK 


502 


vtbr- antagonist 


RGWVEICAADDYGRCLTEAQ 


1031 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1087 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1088 


VEGF- antagonist 


GGNECDIARMWEWECFERL 


1089 


VEGF- antagonist 


RGWVEICAADDYGRCL 


1090 


VEGF-antagonist 


CTTHWGFTLC 


1028 


MMP inhibitor 


CLRSGXGC 


1091 


MMP inhibitor 


CXXHWGFXXC 


1092 


MMP inhibitor 


CXPXC 


1093 


MMP inhibitor 


CRRHWGFEFC 


1094 


MMP inhibitor 


STTHWGFTLS 


1095 


MMP inhibitor 


CSLHWGFWWC 


1096 


CTLA4-mimetic 


GFVCSGIFAVGVGRC 


125 


CTLA4-mimetic 


APGVRLGCAVLGRYC 


126 


CTLA4-mimetic 


LLGRMK 


105 


Antiviral (HBV) 


ICVVQDWGHHRCTAGHMANLTSHASAI 


127 


C3b antagonist 


ICVVQDWGHHRCT 


128 


C3b antagonist 


CVVQDWGHHAC 


129 


C3b antagonist 


STGGFDDVYDWARGVSSALTTTLVATR 


185 


Vinculin-binding 


STGGFDDVYDWARRVSSALTTTLVATR 


186 


Vinculin-binding 


SRGVNFSEWLYDMSAAMKEASNVFPSRRSR 


187 


Vinculin-binding 


SSQNWDMEAGVEDLTAAMLGLLSTIHSSSR 


188 


Vinculin-binding 


SSPSLYTQFLVNYESAATRIQDLLIASRPSR 


189 


Vinculin-binding 


SSTGWVDLLGALQRAADATRTSIPPSLQNSR 


190 


Vinculin-binding 


DVYTKKELIECARRVSEK 


191 


Vinculin-binding 


EKGSYYPGSGIAQFHIDYNNVS 


192 


o*H3p-Dinainy 


SG 1 AQFH 1 DYNN VSSAEGWH VN 


193 


C4BP-binding 


LVTVEKGSYYPGSGIAQFHIDYNNVSSAEGWHVN 


194 


C4BP-binding 


SGIAQFHIDYNNVS 


195 


C4BP-binding 


LLGRMK 


279 


anti-HBV 


ALLGRMKG 


280 


anti-HBV 


LDPAFR 


281 


anti-HBV 


CXXRGDC 


322 


Inhibition of platelet 
aggregation 


RPLPPLP 


323 


Src antagonist 


PPVPPR 


324 


Src antagonist 


XFXDXWXXLXX 


325 


Anti-cancer 
(particularly for 
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sarcomas) 


KACRRLFGPVDSEQLSRDCD 




yj \ \J llllll iCllv 


RERWNFDFVTETPLEGDFAW 


327 


p16-mimetic 


KRRQTSMTDFYHSKRRLIFS 


328 


p!6-mimetic 


TSMTDFYHSKRRL1FSKRKP 


329 


p1 6-mimetic 


RRLIF 


330 


p16-mimetic 


KRRQTSATDFYHSKRRUFSRQIKIWFQNRRMKWKK 


331 


p1 6-mimetic 


KRRLIFSKRQIKIWFQNRRMKWKK 


332 


p1 6-mimetic 


Asn Gin Gly Arg His Phe Cys Gly Gly Ala Leu lie His Ala 
Arq Phe Val Met Thr Ala Ala Ser Cvs Phe Gin 


498 


CAP37 mimetic/LPS 
bindinq 


Arg His Phe Cys Gly Gly Ala Leu lie His Ala Arg Phe Val 
Met Thr Ala Ala Ser Cvs 


499 


CAP37 mimetic/LPS 
bindinq 


Gly Thr Arg Cys Gin Val Ala Gly Trp Gly Ser Gin Arg Ser 
Gly Gly Arg Leu Ser Arg Phe Pro Arg Phe Val Asn Val 


500 


CAP37 mimetic/LPS 
binding 


WHWRHRIPLQLAAGR 


in97 


Mrbnhvdrate fGD1 
alpha) mimetic 


LKTPRv 


1098 


P2GPI Ab binding 


NTLKTPRV 


1099 


(32GPI Ab binding 


NTLKTP R VGGC 


i inn 




KDKATF 


1101 


02GPI Ab binding 


KDKATFGCHD 


1102 


P2GPI Ab binding 


KDKATFGCHDGC 


1103 


32GPI Ab binding 


TLRVYK 


1104 


02GPI Ab binding 


ATLRVYKGG 


1105 


02GPI Ab binding 


CATLRVYKGG 


1106 


(32GPI Ab binding 


INLKALAALAKKIL 


1107 


Membrane- 
transportinq 


GWT 


NR 


Membrane- 
transportinq 


GWTLNSAGYLLG 


1108 


Membrane- 
transoortinq 


GWTLNSAGYLLGKINLKALAALAKKIL 


1109 


Membrane- 
transoortinq 


The present invention is also particularly useful with peptides 



having activity in treatment of: 

• cancer, wherein the peptide is a VEGF-mimetic or a VEGF receptor 
antagonist, a HER2 agonist or antagonist, a CD20 antagonist and the 
like; 

• asthma, wherein the protein of interest is a CKR3 antagonist, an IL-5 
receptor antagonist, and the like; ■ — 

• thrombosis, wherein the protein of interest is a GPIIb antagonist, a 
GPIIIa antagonist, and the like; 
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• autoimmune diseases and other conditions involving immune 
modulation, wherein the protein of interest is an IL-2 receptor 
antagonist, a CD40 agonist or antagonist, a CD40L agonist or 
antagonist, a thymopoietin mimetic and the like. 
> Vehicles . This invention requires the presence of at least one vehicle 

(F 1 , F 2 ) attached to a peptide through the N-terminus, C-terminus or a 
sidechain of one of the amino acid residues. Multiple vehicles may also be 
used; e.g., Fc's at each terminus or an Fc at a terminus and a PEG group at 
the other terminus or a sidechain. 
D An Fc domain is the preferred vehicle. The Fc domain may be fused 

to the N or C termini of the peptides or at both the N and C termini. For 
the TPO-mimetic peptides, molecules having the Fc domain fused to the N 
terminus of the peptide portion of the molecule are more bioactive than 
other such fusions, so fusion to the N terminus is preferred. 
5 As noted above, Fc variants are suitable vehicles within the scope of 

this invention. A native Fc may be extensively modified to form an Fc 
variant in accordance with this invention, provided binding to the salvage 
receptor is maintained; see, for example WO 97/34631 and WO 96/32478. 
In such Fc variants, one may remove one or more sites of a native Fc that 
0 provide structural features or functional activity not required by the 
fusion molecules of this invention. One may remove these sites by, for 
example, substituting or deleting residues, inserting residues into the site, 
or truncating portions containing the site. The inserted or substituted 
residues may also be altered amino acids, such as peptidomimetics or D- 
5 amino acids. Fc variants may be desirable for a number of reasons, several 
of which are described below. Exemplary Fc variants include molecules 
and sequences in which: 

1 . Sites involved in disulfide bond formation are removed. Such removal 
may avoid reaction with other cysteine-containing proteins present in 

(ft) 
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the host cell used to produce the molecules of the invention. For this 
purpose, the cysteme^ontaining segment at the N-terminus may be 
truncated or cysteine residues may be deleted or substituted with other 
amino acids (e.g., alanyl, seryl). In particular, one may truncate the N- 
5 terminal 20-amino acid segment of SEQ ID NO: 2 or delete or 

substitute the cysteine residues at positions 7 and 10 of SEQ ID NO: 2. 
Even when cysteine residues are removed, the single chain Fc domains 
can still form a dimeric Fc domain that is held together non-covalently. 

2. A native Fc is modified to make it more compatible with a selected host 
L 0 cell. For example, one may remove the PA sequence near the N- 

terminus of a typical native Fc, which may be recognized by a digestive 
enzyme in E. coli such as proline iminopeptidase. One may also add an 
N-terminal methionine residue, especially when the molecule is 
expressed recombinantly in a bacterial cell such as E. coli. The Fc 
1 5 domain of SEQ ID NO: 2 (Figure 4) is one such Fc variant. 

3. A portion of the N-terminus of a native Fc is removed to prevent N- 
terminal heterogeneity when expressed in a selected host cell. For this 
purpose, one may delete any of the first 20 amino acid residues at the 
N-terminus, particularly those at positions 1, 2, 3, 4 and 5. 

20 4. One or more glycosylation sites are removed. Residues that are 

typically glycosylated (e.g., asparagine) may confer cytolytic response. 
Such residues may be deleted or substituted with unglycosylated 
residues (e.g., alanine). 
5. Sites involved in interaction with complement, such as the Clq binding 

2 5 site, are removed. For example, one may delete or substitute the EKK 
sequence of human IgGl. Complement recruitment may not be 
advantageous for the molecules of this invention and so may be 
avoided with such an Fc variant. 
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6. Sites are removed that affect binding to Fc receptors other than a 
salvage receptor. A native Fc may have sites for interaction with 
certain white blood cells that are not required for the fusion molecules 
of the present invention and so may be removed. 
5 7. The ADCC site is removed. ADCC sites are known in the art; see, for 

example, Mnlec. Immunol . 29 (5): 633-9 (1992) with regard to ADCC 
sites in IgGl. These sites, as well, are not required for the fusion 
molecules of the present invention and so may be removed. 
8. When the native Fc is derived from a non-human antibody, the native 

10 Fc may be humanized. Typically, to humanize a native Fc, one will 

substitute selected residues in the non-human native Fc with residues 
that are normally found in human native Fc. Techniques for antibody 
humanization are well known in the art. 

Preferred Fc variants include the following. In SEQ ID NO: 2 
15 (Figure 4) the leucine at position 15 may be substituted with glutamate; the 
glutamate at position 99, with alanine; and the lysines at positions 101 and 
103, with alanines. In addition, one or more tyrosine residues can be 
replaced by phenyalanine residues. 

An alternative vehicle would be a protein, polypeptide, peptide, 
2 0 antibody, antibody fragment, , or small molecule (e.g., a peptidomimetic 
compound) capable of binding to a salvage receptor. For example, one 
could use as a vehicle a polypeptide as described in U.S. Pat. No. 5,739,277, 
issued April 14, 1998 to Presta etal. Peptides could also be selected by 
phage display for binding to the FcRn salvage receptor. Such salvage 
2 5 receptor-binding compounds are also included within the meaning of 

"vehicle" and are within the scope of this invention. Such vehicles should 
be selected for increased half-life (e.g., by avoiding sequences recognized 
by proteases) and decreased immunogenicity (e.g., by favoring non- 
immunogenic sequences, as discovered in antibody humanization). 

iff 3* 
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As noted above, polymer vehicles may also be used for F 1 and F 2 . 
Various means for attaching chemical moieties useful as vehicles are 
currently available, see, e.g., Patent Cooperation Treaty ("PCT") 
International Publication No. WO 96/11953, entitled "N-Terminally 
5 Chemically Modified Protein Compositions and Methods," herein 

incorporated by reference in its entirety. This PCT publication discloses, 
among other things, the selective attachment of water soluble polymers to 
the N-terminus of proteins. 

A preferred polymer vehicle is polyethylene glycol (PEG). The PEG 
1 0 group may be of any convenient molecular weight and may be linear or 
branched. The average molecular weight of the PEG will preferably range 
from about 2 kiloDalton ("kD") to about 100 kDa, more preferably from 
about 5 kDa to about 50 kDa, most preferably from about 5 kDa to about 
10 kDa. The PEG groups will generally be attached to the compounds of 
15 the invention via acylation or reductive alkylation through a reactive 

group on the PEG moiety (e.g., an aldehyde, amino, thiol, or ester group) 
to a reactive group on the inventive compound (e.g., an aldehyde, amino, 
or ester group). 

A useful strategy for the PEGylation of synthetic peptides consists 
2 0 of combining, through forming a conjugate linkage in solution, a peptide 
and a PEG moiety, each bearing a special functionality that is mutually 
reactive toward the other. The peptides can be easily prepared with 
conventional solid phase synthesis (see, for example, Figures 5 and 6 and 
the accompanying text herein). The peptides are "preactivated" with an 
2 5 appropriate functional group at a specific site. The precursors are purified 
and fully characterized prior to reacting with the PEG moiety. Ligation of 
the peptide with PEG usually takes place in aqueous phase and can be 
easily monitored by reverse phase analytical HPLC. The PEGylated 
peptides can be easily purified by preparative HPLC and characterized by 
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analytical HPLC, amino acid analysis and laser desorption mass 
spectrometry. 

Polysaccharide polymers are another type of water soluble polymer 
which may be used for protein modification. Dextrans are polysaccharide 
5 polymers comprised of individual subunits of glucose predominantly 
linked by al-6 linkages. The dextran itself is available in many molecular 
weight ranges, and is readily available in molecular weights from about 1 
kD to about 70 kD. Dextran is a suitable water soluble polymer for use in 
the present invention as a vehicle by itself or in combination with another 
10 vehicle (e.g., Fc). See, for example, WO 96/11953 and WO 96/05309. The 
use of dextran conjugated to therapeutic or diagnostic immunoglobulins 
has been reported; see, for example, European Patent Publication No. 0 
315 456, which is hereby incorporated by reference. Dextran of about 1 kD 
to about 20 kD is preferred when dextran is used as a vehicle in 
1 5 accordance with the present invention. 

Linkers . Any "linker" group is optional. When present, its chemical 
structure is not critical, since it serves primarily as a spacer. The linker is 
preferably made up of amino acids linked together by peptide bonds. 
Thus, in preferred embodiments, the linker is made up of from 1 to 20 
2 0 amino acids linked by peptide bonds, wherein the amino acids are selected 
from the 20 naturally occurring amino acids. Some of these amino acids 
may be glycosylated, as is well understood by those in the art. In a more 
preferred embodiment, the 1 to 20 amino acids are selected from glycine, 
alanine, proline, asparagine, glutamine, and lysine. Even more preferably, 
2 5 a linker is made up of a majority of amino acids that are sterically 

unhindered, such as glycine and alanine. Thus, preferred linkers are 
poiyglycines (particularly (Gly) 4 , (Gly) 5 ), poly(Gly-Ala), and polyalanines. 
Other specific examples of linkers are: 

(Gly) 3 Lys(Gly) 4 (SEQ ID NO: 333); 
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10 



15 



(Gly) 3 AsnGlySer(Gly) 2 (SEQ ID NO: 334); 
(Gly) 3 Cys(Gly) 4 (SEQ ID NO: 335); and 

GlyProAsnGlyGly (SEQ ID NO: 336). 
To explain the above nomenclature, for example, (Gly) 3 Lys(Gly) 4 means 
Gly-Gly-Gly-Lys-Gly-Gly-Gly-Gly- Combinations of Gly and Ala are also 
preferred. The linkers shown here are exemplary; linkers within the scope 
of this invention may be much longer and may include other residues. 

Non-peptide linkers are also possible. For example, alkyl linkers 
such as -NH-(CH 2 ) s -C(0)-, wherein s = 2-20 could be used. These alkyl 
linkers may further be substituted by any non-sterically hindering group 
such as lower alkyl (e.g., C,-C 6 ) lower acyl, halogen (e.g., CI, Br), CN, NH,, 
phenyl, etc. An exemplary non-peptide linker is a PEG linker, 
VI 

O 




wherein n is such that the linker has a molecular weight of 100 to 5000 kD, 
preferably 100 to 500 kD. The peptide linkers may be altered to form 
derivatives in the same manner as described above. 

Derivatives . The inventors also contemplate derivatizing the 
2 0 peptide and/or vehicle portion of the compounds. Such derivatives may 
improve the solubility, absorption, biological half life, and the like of the 
compounds. The moieties may alternatively eliminate or attenuate any 
undesirable side-effect of the compounds and the like. Exemplary 
derivatives include compounds in which: ^ 
25 1. The compound or some portion thereof is cyclic. For example, the 

peptide portion may be modified to contain two or more Cys residues 
(e.g., in the linker), which could cydize by disulfide bond formation. 
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10 



25 



For citations to references on preparation of cyclized derivatives, see 
Table 2. 

2. The compound is cross-linked or is rendered capable of cross-linking 
between molecules. For example, the peptide portion may be modified 
to contain one Cys residue and thereby be able to form an 
intermolecular disulfide bond with a like molecule. The compound 
may also be cross-linked through its C-terminus, as in the molecule 
shown below. 

vn 

F 1 -(X 1 jb-caN^-^y^N^ 

F 1 -(X 1 ) b -CON_^NH 
3. ° 

4 . One or more peptidyl [-C(C0NR-] linkages (bonds) is replaced by a 
non-peptidyl linkage. Exemplary non-peptidyl linkages are -CH 2 - 
carbamate [-CH 2 -OC(0)NR-], phosphonate , -^-sulfonamide [-CH,- 
S(0) 2 NR-], urea [-NHC(0)NH-], -CH 2 -secondary amine, and alkylated 
1 5 peptide [-C(0)NR 4 - wherein R* is lower alkyl]. 

5. The N-terminus is derivatized. Typically, the N-terminus may be 
acylated or modified to a substituted amine. Exemplary N-terminal 
derivative groups include -NRR 1 (other than -NH 2 ), -NRC(0)R 1 , 
-NRC(0)OR 1 , -NRS(0) 2 R\ -NHC(0)NHR', succinimide, or 

20 benzyloxycarbonyl-NH- (CBZ-NH-), wherein R and R 1 are each 

independently hydrogen or lower alkyl and wherein the phenyl ring 
may be substituted with 1 to 3 substituents selected from the group 
consisting of C,-C 4 alkyl, C r C 4 alkoxy, chloro, and bromo. 

6. The free C-terminus is derivatized. Typically, the C-terminus is 
esterified or amidated. For example, one may use methods described in 
the art to add (NH-CH 2 -CH 2 -NH 2 ) 2 to compounds of this invention 
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having any of SEQ ID NOS: 504 to 508 at the C-terminus. Likewise, 
one may use methods described in the art to add -NH, to compounds 
of this invention having any of SEQ ID NOS: 924 to 955, 963 to 972, 
1005 to 1013, or 1018 to 1023 at the C-terminus. Exemplary C-terminal 
5 derivative groups include, for example, -C(0)R 2 wherein R 2 is lower 

alkoxy or -NR 3 R 4 wherein R 3 and R 4 are independently hydrogen or C r 
C 8 alkyl (preferably C,-C 4 alkyl). 

7. A disulfide bond is replaced with another, preferably more stable, 
cross-linking moiety (e.g., an alkylene). See, e.g., Bhatnagar etaL 

1 o (1996), T. Med. Chem . 39: 3814-9; Alberts etal. (1993) Thirteenth Am. 

Pep. Svmp ., 357-9. 

8. One or more individual amino acid residues is modified. Various 
derivatizing agents are known to react specifically with selected 
sidechains or terminal residues, as described in detail below. 

1 5 Lysinyl residues and amino terminal residues may be reacted with 

succinic or other carboxylic acid anhydrides, which reverse the charge of the 
lysinyl residues. Other suitable reagents for derivaozing alpha-amino- 
containing residues include imidoesters such as methyl picolmimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic 

2 0 acid; O-methylisourea; 2,4 pentanedione; and transaminase-catalyzed reaction 

with glyoxylate. 

Arginyl residues may be modified by reaction with any one or 
combination of several conventional reagents, including phenylglyoxal, 2,3- 
butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginyl 
2 5 residues requires that the reaction be performed in alkaline conditions because 
of the high pKa of the guanidine functional group. Furthermore, these reagents 
may react with the groups of lysine as well as the argmine-epsilon-amino - 
group. 
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Specific modification of tyrosyl residues has been studied extensively, 
with particular interest in introducing spectral labels into tyrosyl residues by 
reaction with aromatic diazonium compounds or tetranitromethane. Most 
commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl 
5 tyrosyl species and 3-nitro derivatives, respectively. 

Carboxyl sidechain groups (aspartyl or glutamyl) may be selectively 
modified by reaction with carbodiimides (R'-N=C=N-R') such as 1-cyclohexyl- 
3-(2-morpholinyl-(4-ethyl) carbodiimide or l-ethyl-3-(4-azonia-4,4- 
dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues 

1 o may be converted to asparaginyl and glutaminyl residues by reaction with 

ammonium ions. 

Glutaminyl and asparaginyl residues may be deamidated to the 
corresponding glutamyl and aspartyl residues. Alternatively, these residues 
are deamidated under mildly acidic conditions. Either form of these residues 
1 5 falls within the scope of this invention. 

Cysteinyl residues can be replaced by amino acid residues or other 
moieties either to eliminate disulfide bonding or, conversely, to stabilize cross- 
linking. See, e.g., Bhatnagar etal. (1996), T-Med.Chem. 39: 3814-9. 

Derivatization with bifunctional agents is useful for cross-linking the 

2 0 peptides or their functional derivatives to a water-insoluble support matrix or 

to other macromolecular vehicles. Commonly used cross-linking agents 
include, e.g., l>bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, 
homobifunctional imidoesters, including disuccinimidyl esters such as 3^'- 
ditmobis(succinimidylpropionate), and bifunctional maleimides such as bis-N- 
maleimido-l,8-octane. Derivatizing agents such as methyl-3-[(p- 
azidophenyDdithiolpropioimidate yield photoactivatable intermediates that are 
capable of forming crosslinks in the presence of light. Alternatively, reactive 
water-insoluble matrices such as cyanogen bromide-activated carbohydrates 
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and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691,016; 
4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein 

immobilization. 

Carbohydrate (oligosaccharide) groups may conveniently be 
attached to sites that are known to be glycosylation sites in proteins. 
Generally, O-linked oligosaccharides are attached to serine (Ser) or 
threonine (Thr) residues while N-linked oligosaccharides are attached to 
asparagine (Asn) residues when they are part of the sequence Asn-X- 
Ser/Thr, where X can be any amino acid except proline. X is preferably 
one of the 19 naturally occurring amino acids other than proline. The 
structures of N-linked and O-linked oligosaccharides and the sugar 
residues found in each type are different. One type of sugar that is 
commonly found on both is N-acetymeuraminic acid (referred to as sialic 
acid). Sialic acid is usually the terminal residue of both N-linked and O- 
1 5 linked oligosaccharides and, by virtue of its negative charge, may confer 
acidic properties to the glycosylated compound. Such site(s) may be 
incorporated in the linker of the compounds of this invention and are 
preferably glycosylated by a cell during recombinant production of the 
polypeptide compounds (e.g., in mammalian cells such as CHO, BHK, 
20 COS). However, such sites may further be glycosylated by synthetic or 

semi-synthetic procedures known in the art. 

Other possible modifications include hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, 
oxidation of the sulfur atom in Cys, methylation of the alpha-amino 
2 5 groups of lysine, arginine, and histidine side chains. Creighton, Proteins: 
fihurhm* and Mnipmle Properties (W. H. Freeman & Co., San Francisco), 
pp. 79-86 (1983). 

Compounds of the present invention may be changed at the DNA 
level, as well. The DNA sequence of any portion of the compound may be 
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changed to codons more compatible with the chosen host cell. For Rcoli, 
which is the preferred host cell, optimized codons are known in the art. 
Codons may be substituted to eUminate restriction sites or to include silent 
restriction sites, which may aid in processing of the DNA in the selected 
host cell. The vehicle, linker and peptide DNA sequences may be modified 
to include any of the foregoing sequence changes. 
Methods of Making 

The compounds of this invention largely may be made in 
transformed host cells using recombinant DNA techniques. To do so, a 
recombinant DNA molecule coding for the peptide is prepared. Methods 
of preparing such DNA molecules are well known in the art. For instance, 
sequences coding for the peptides could be excised from DNA using 
suitable restriction enzymes. Alternatively, the DNA molecule could be 
synthesized usmg chemical synthesis techniques, such as the 
1 5 phosphoramidate method. Also, a combination of these techniques could 
be used. 

The invention also includes a vector capable of expressing the 
peptides in an appropriate host. The vector comprises the DNA molecule 
that codes for the peptides operatively linked to appropriate expression 
2 0 control sequences. Methods of effecting this operative linking, either 
before or after the DNA molecule is inserted into the vector, are well 
known. Expression control sequences include promoters, activators, 
enhancers, operators, ribosomal binding sites, start signals, stop signals, 
cap signals, polyadenylation signals, and other signals involved with the 
2 5 control of transcription or translation. 

The resulting vector having the DNA molecule thereon is used to 
transform an appropriate host. This transformation may be performed 
using methods well known in the art. 
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Any of a large number of available and well-known host cells may 
be used in the practice of this invention. The selection of a particular host 
is dependent upon a number of factors recognized by the art. These 
include, for example, compatibility with the chosen expression vector, 
5 toxicity of the peptides encoded by the DNA molecule, rate of 

transformation, ease of recovery of the peptides, expression characteristics, 
bio-safety and costs. A balance of these factors must be struck with the 
understanding that not all hosts may be equally effective for the 
expression of a particular DNA sequence. Within these general guidelines, 
1 0 useful microbial hosts include bacteria (such as E. coli sp.), yeast (such as 
Saccharomvces sp.) and other fungi, insects, plants, mammalian (including 
human) cells in culture, or other hosts known in the art. 

Next, the transformed host is cultured and purified. Host cells may 
be cultured under conventional fermentation conditions so that the 
1 5 desired compounds are expressed. Such fermentation conditions are well 
known in the art. Finally, the peptides are purified from culture by 
methods well known in the art. 

The compounds may also be made by synthetic methods. For 
example, solid phase synthesis techniques may be used. Suitable 
2 0 techniques are well known in the art, and include those described in 
Merrifield (1973), Chem. Polypeptides, pp. 335-61 (Katsoyannis and 
Panayotis eds.); Merrifield (1963), T. Am. Chem. Soc . 85: 2149; Davis etal. 
(1985), Biochem. Intl . 10: 394-414; Stewart and Young (1969), Solid Phase 
Peptide Synthesis; U.S. Pat. No. 3,941,763; Finn etal. (1976), The Proteins 
2 5 (3rd ed.) 2: 105-253; and Erickson etal. (1976), The Proteins (3rd ed.) 2: 
257-527. Solid phase synthesis is the preferred technique of making 
individual peptides since it is the most cost-effective method of making 
small peptides. 
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Compounds that contain derivatized peptides or which contain 
non-peptide groups may be synthesized by well-known organic chemistry 
techniques. 
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Uses of the Compounds 

In general . The compounds of this invention have pharmacologic 
activity resulting from their ability to bind to proteins of interest as 
agonists, mimetics or antagonists of the native ligands of such* proteins of 
5 interest. The utility of specific compounds is shown in Table 2. The activity 
of these compounds can be measured by assays known in the art. For the 
TPO-mimetic and EPO-mimetic compounds, in vivo assays are further 
described in the Examples section herein. 

In addition to therapeutic uses, the compounds of the present 

1 0 invention are useful in diagnosing diseases characterized by dysfunction 
of their associated protein of interest. In one embodiment, a method of 
detecting in a biological sample a protein of interest (e.g., a receptor) that 
is capable of being activated comprising the steps of: (a) contacting the 
sample with a compound of this invention; and (b) detecting activation of 

1 5 the protein of interest by the compound. The biological samples include 
tissue specimens, intact cells, or extracts thereof. The compounds of this 
invention may be used as part of a diagnostic kit to detect the presence of 
their associated proteins of interest in a biological sample. Such kits 
employ the compounds of the invention having an attached label to allow 

2 0 for detection. The compounds are useful for identifying normal or 
abnormal proteins of interest. For the EPO-mimetic compounds, for 
example, presence of abnormal protein of interest in a biological sample 
may be indicative of such disorders as Diamond Blackfan anemia, where it 
is believed that the EPO receptor is dysfunctional. 

2 5 Therapeutic uses of EPO-mimetic compounds . The EPO-mimetic 

compounds of the invention are useful for treating disorders characterized 
by low red blood cell levels. Included in the invention are methods of 
modulating the endogenous activity of an EPO receptor in a mammal, 
preferably methods of increasing the activity of an EPO receptor. In 

13 
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general, any condition treatable by erythropoietin, such as anemia, may 
also be treated by the EPO-mimetic compounds of the invention. These 
compounds are administered by an amount and route of delivery that is 
appropriate for the nature and severity of the condition being treated and 
5 may be ascertained by one skilled in the art. Preferably, administration is 
by injection, either subcutaneous, intramuscular, or intravenous. 

Therapeutic uses of TPO-mimetic compounds . FortheTPO- 
mimetic compounds, one can utilize such standard assays as those 
described in W095/26746 entitled "Compositions and Methods for 
1 0 Stimulating Megakaryocyte Growth and Differentiation". In vivo assays 
also appear in the Examples hereinafter. 

The conditions to be treated are generally those that involve an 
existing megakaryocyte/platelet deficiency or an expected 
megakaryocyte/platelet deficiency (e.g., because of planned surgery or 
1 5 platelet donation) . Such conditions will usually be the result of a 

deficiency (temporary or permanent) of active Mpl ligand in vivo. The 
generic term for platelet deficiency is thrombocytopenia, and hence the 
methods and compositions of the present invention are generally available 
for treating thrombocytopenia in patients in need thereof. 
2 0 Thrombocytopenia (platelet deficiencies) may be present for 

various reasons, including chemotherapy and other therapy with a variety 
of drugs, radiation therapy, surgery, accidental blood loss, and other 
specific disease conditions. Exemplary specific disease conditions that 
involve thrombocytopenia and may be treated in accordance with this 
2 5 invention are: aplastic anemia, idiopathic thrombocytopenia, metastatic 
tumors which result in thrombocytopenia, systemic lupus erythematosus, 
splenomegaly, Fanconi's syndrome, vitamin B12 deficiency; folic acid 
deficiency, May-Hegglin anomaly, Wiskott-Aldrich syndrome, and 
paroxysmal nocturnal hemoglobinuria. Also, certain treatments for AIDS 
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result in thrombocytopenia (e.g., AZT). Certain wound healing disorders 
might also benefit from an increase in platelet numbers. 

With regard to anticipated platelet deficiencies, e.g., due to future 
surgery, a compound of the present invention could be administered 
5 several days to several hours prior to the need for platelets. With regard 
to acute situations, e.g., accidental and massive blood loss, a compound of 
this invention could be administered along with blood or purified 
platelets. 

The TPO-mimetic compounds of this invention may also be useful in 

1 o stimulating certain cell types other than megakaryocytes if such cells are found 

to express Mpl receptor. Conditions associated with such cells that express the 
Mpl receptor, which are responsive to stimulation by the Mpl ligand, are also 
within the scope of this invention. 

The TPO-mimetic compounds of this invention may be used in any 
1 5 situation in which production of platelets or platelet precursor cells is desired, 
or in which stimulation of the c-Mpl receptor is desired. Thus, for example, the 
compounds of this invention may be used to treat any condition in a mammal 
wherein there is a need of platelets, megakaryocytes, and the like. Such 
conditions are described in detail in the following exemplary sources: 

2 0 W095 /26746; W095/21919; W095/18858; WO95/21920 and are incorporated 

herein. 

The TPO-mimetic compounds of this invention may also be useful in 
maintaining the viability or storage life of platelets and/or megakaryocytes and 
related cells. Accordingly, it could be useful to include an effective amount of 
2 5 one or more such compounds in a composition containing such cells. 

The therapeutic methods, compositions and compounds of the 
present invention may also be employed, alone or in combination with 
other cytokines, soluble Mpl receptor, hematopoietic factors, interleukins, 
growth factors or antibodies in the treatment of disease states 
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characterized by other symptoms as well as platelet deficiencies. It is 
anticipated that the inventive compound will prove useful in treating 
some forms of thrombocytopenia in combination with general stimulators 
of hematopoiesis, such as IL-3 or GM-CSF. Other megakaryocy tic 

5 stimulatory factors, i.e., meg-CSF, stem cell factor (SCF), leukemia 
inhibitory factor (LDF), oncostatin M (OSM), or other molecules with 
megakaryocyte stimulating activity may also be employed with Mpl 
ligand. Additional exemplary cytokines or hematopoietic factors for such 
co-administration include IL-1 alpha, IL-1 beta, IL-2, IL-3, IL-4, IL-5, IL-6, 

0 IL-11, colony stimulating factor-1 (CSF-1), SCF, GM-CSF, granulocyte 
colony stimulating factor (G-CSF), EPO, interferon-alpha (IFN-alpha), 
consensus interferon, IFN-beta, or IFN-gamma. It may further be useful to 
administer, either simultaneously or sequentially, an effective amount of a 
soluble mammalian Mpl receptor, which appears to have an effect of 

5 causing megakaryocytes to fragment into platelets once the 

megakaryocytes have reached mature form. Thus, administration of an 
inventive compound (to enhance the number of mature megakaryocytes) 
followed by administration of the soluble Mpl receptor (to inactivate the 
ligand and allow the mature megakaryocytes to produce platelets) is 

0 expected to be a particularly effective means of stimulating platelet 

production. The dosage recited above would be adjusted to compensate 
for such additional components in the therapeutic composition. Progress 
of the treated patient can be monitored by conventional methods. 

In cases where the inventive compounds are added to compositions 

5 of platelets and /or megakaryocytes and related cells, the amount to be 
included will generally be ascertained experimentally by techniques and 
assays known in the art. An exemplary range of amounts is~0.1 ug— 1 mg 
inventive compound per 10 6 cells. 
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Pharmaceutical Compositions 

In General , The present invention also provides methods of using 
pharmaceutical compositions of the inventive compounds. Such 
pharmaceutical compositions may be for administration for injection, or for 

5 oral, pulmonary, nasal, transdermal or other forms of administration. In 

general, the invention encompasses pharmaceutical compositions comprising 
effective amounts of a compound of the invention together with 
pharmaceutical^ acceptable diluents, preservatives, solubilizers, emulsifiers, 
adjuvants and/or carriers. Such compositions include diluents of various 

0 buffer content (e.g., Tris-HCl, acetate, phosphate), pH and ionic strength; 
additives such as detergents and solubilizing agents (e.g., Tween 80, 
Polysorbate 80), anti-oxidants (e.g., ascorbic acid, sodium metabisulfite), 
preservatives (e.g., Thimersol, benzyl alcohol) and bulking substances (e.g., 
lactose, mannitol); incorporation of the material into particulate preparations of 

5 polymeric compounds such as polylactic acid, polyglycolic acid, etc. or into 
liposomes. Hyaluronic acid may also be used, and this may have the effect of 
promoting sustained duration in the circulation. Such compositions may 
influence the physical state, stability, rate of in vivo release, and rate of in vivo 
clearance of the present proteins and derivatives. See, e.g., Remington's 

0 Pharmaceutical Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, PA 
18042) pages 1435-1712 which are herein incorporated by reference. The 
compositions may be prepared in liquid form, or may be in dried powder, such 
as lyophilized form. Implantable sustained release formulations are also 
contemplated, as are transdermal formulations. 

5 Oral dosage forms . Contemplated for use herein are oral solid 

dosage forms, which are described generally in Chapter 89 of Remington's 
Pharmaceutical Sciences (1990), 18th Ed., Mack Publishing Co. Easton PA - 
18042, which is herein incorporated by reference. Solid dosage forms 
include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, 

I 9 ) 
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liposomal or proteinoid encapsulation may be used to formulate the 
present compositions (as, for example, proteinoid microspheres reported 
in U.S. Patent No. 4,925,673). Liposomal encapsulation may be used and 
the liposomes may be derivatized with various polymers (e.g., U .S. Patent 
5 No. 5,013,556) . A description of possible solid dosage forms for the 

therapeutic is given in Chapter 10 of Marshall, K., Modern Pharmaceutics 
(1979), edited by G. S. Banker and C. T. Rhodes, herein incorporated by 
reference. In general, the formulation will include the inventive 
compound, and inert ingredients which allow for protection against the 
1 0 stomach environment, and release of the biologically active material in the 
intestine. 

Also specifically contemplated are oral dosage forms of the above 
inventive compounds. If necessary, the compounds may be chemically 
modified so that oral delivery is efficacious. Generally, the chemical 

1 5 modification contemplated is the attachment of at least one moiety to the 
compound molecule itself, where said moiety permits (a) inhibition of 
proteolysis; and (b) uptake into the blood stream from the stomach or 
intestine. Also desired is the increase in overall stability of the compound 
and increase in circulation time in the body. Moieties useful as covalently 

2 0 attached vehicles in this invention may also be used for this purpose. 
Examples of such moieties include: PEG, copolymers of ethylene glycol 
and propylene glycol, carboxymethyl cellulose, dextran, polyvinyl alcohol, 
polyvinyl pyrrolidone and polyproline. See, for example, Abuchowski and 
Davis, Soluble Polvmer-Enzvme Adducts . Enzvmes as Drugs (1981), 

2 5 Hocenberg and Roberts, eds., Wiley-Interscience, New York, NY, , pp 367- 
83; Newmark, etal. (1982), T. Appl. Biochem . 4:185-9. Other polymers that 
could be used are poly-l,3-dioxolane and poly-l,3,6-tioxocane. Preferred 
for pharmaceutical usage, as indicated above, are PEG moieties. 

It 
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For oral delivery dosage forms, it is also possible to use a salt of a 
modified aliphatic amino acid, such as sodium N-(8-[2-hydroxybenzoyl] 
amino) caprylate (SNAC), as a carrier to enhance absorption of the 
therapeutic compounds of this invention. The clinical efficacy of a heparin 
5 formulation using SNAC has been demonstrated in a Phase II trial 

conducted by Emisphere Technologies. See US Patent No. 5,792,451, "Oral 
drug delivery composition and methods''. 

The compounds of this invention can be included in the 
formulation as fine multiparticulates in the form of granules or pellets of 

1 0 particle size about 1 mm. The formulation of the material for capsule 
administration could also be as a powder, lightly compressed plugs or 
even as tablets. The therapeutic could be prepared by compression. 

Colorants and flavoring agents may all be included. For example, 
the protein (or derivative) may be formulated (such as by liposome or 

1 5 microsphere encapsulation) and then further contained within an edible 
product, such as a refrigerated beverage containing colorants and 
flavoring agents. 

One may dilute or increase the volume of the compound of the 
invention with an inert material. These diluents could include 

2 0 carbohydrates, especially mannitol, a-lactose, anhydrous lactose, cellulose, 
sucrose, modified dextrans and starch. Certain inorganic salts may also be 
used as fillers including calcium triphosphate, magnesium carbonate and 
sodium chloride. Some commercially available diluents are Fast-Flo, 
Emdex, STA-Rx 1500, Emcompress and Avicell. 

2 5 Disintegrants may be included in the formulation of the therapeutic 

into a solid dosage form. Materials used as disintegrants include but are 
not limited to starch including the commercial disintegrantt>ased on 
starch, Explotab. Sodium starch glycolate, Amberlite, sodium 
carboxymethylcellulose, ultramylopectin, sodium alginate, gelatin, orange 
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peel, acid carboxymethyl cellulose, natural sponge and bentonite may all 
be used. Another form of the disintegrants are the insoluble canonic 
exchange resins. Powdered gums may be used as disintegrants and as 
binders and these can include powdered gums such as agar, Karaya or 
5 tragacahth. Alginic acid and its sodium salt are also useful as 
disintegrants. 

Binders may be used to hold the therapeutic agent together to form 
a hard tablet and include materials from natural products such as acacia, 
tragacanth, starch and gelatin. Others include methyl cellulose (MC), ethyl 

0 cellulose (EC) and carboxymethyl cellulose (CMC). Polyvinyl pyrrolidone 
(PVP) and hydroxypropylmethyl cellulose (HPMC) could both be used in 
alcoholic solutions to granulate the therapeutic. 

An antifrictional agent may be included in the formulation of the 
therapeutic to prevent sticking during the formulation process. Lubricants 

5 may be used as a layer between the therapeutic and the die wall, and these 
can include but are not limited to; stearic acid including its magnesium 
and calcium salts, polytetrafluoroethylene (PTFE), liquid paraffin, 
vegetable oils and waxes. Soluble lubricants may also be used such as 
sodium lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of 

0 various molecular weights, Carbowax 4000 and 6000. 

Glidants that might improve the flow properties of the drug during 
formulation and to aid rearrangement during compression might be 
added. The glidants may include starch, talc, pyrogenic silica and 
hydrated suicoaluminate. 

5 To aid dissolution of the compound of this invention into the 

aqueous environment a surfactant might be added as a wetting agent. 
Surfactants may include anionic detergents such as sodium lauryl sulfate, 
dioctyl sodium sulfosuccinate and dioctyl sodium sulfonate. Cationic 
detergents might be used and could include benzalkonium chloride or 

SO 
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benzethonium chloride. The list of potential nonionic detergents that 
could be included in the formulation as surfactants are lauromacrogol 400, 
polyoxyl 40 stearate, polyoxyethylene hydrogenated castor oil 10, 50 and 
60, glycerol monostearate, polysorbate 40, 60, 65 and 80, sucrose fatty acid 
5 ester, methyl cellulose and carboxymethyl cellulose. These surfactants 
could be present in the formulation of the protein or derivative either 
alone or as a mixture in different ratios. 

Additives may also be included in the formulation to enhance 
uptake of the compound. Additives potentially having this property are 
0 for instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

Controlled release formulation may be desirable. The compound of 
this invention could be incorporated into an inert matrix which permits 
release by either diffusion or leaching mechanisms e.g., gums. Slowly 
degenerating matrices may also be incorporated into the formulation, e.g., 
5 alginates, polysaccharides. Another form of a controlled release of the 

compounds of this invention is by a method based on the Oros therapeutic 
system (Alza Corp.), i.e., the drug is enclosed in a semipermeable 
membrane which allows water to enter and push drug out through a 
single small opening due to osmotic effects. Some enteric coatings also 
0 have a delayed release effect. 

Other coatings may be used for the formulation. These include a 
variety of sugars which could be applied in a coating pan. The therapeutic 
agent could also be given in a film coated tablet and the materials used in 
this instance are divided into 2 groups. The first are the nonenteric 
5 materials and include methyl cellulose, ethyl cellulose, hydroxyethyl 
cellulose, methylhydroxy-ethyl cellulose, hydroxypropyl cellulose, 
hydroxypropyl-methyl cellulose, sodium carboxy-methyl cellulose, 
providone and the polyethylene glycols. The second group consists of the 
enteric materials that are commonly esters of phthalic acid. 
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A mix of materials might be used to provide the optimum film 
coating. Film coating may be carried out in a pan coater or in a fluidized 
bed or by compression coating. 

Pulmonary delivery forms . Also contemplated herein is pulmonary 
5 delivery of the present protein (or derivatives thereof). The protein (or 
derivative) is delivered to the lungs of a mammal while inhaling and 
traverses across the lung epithelial lining to the blood stream. (Other 
reports of this include Adjei etaL, Pharma. Res . (1990) 7: 565-9; Adjei etaL 
(1990), Internatl. T. Pharmaceutics 63: 135-44 (leuprolide acetate); Braquet 

1 0 etal. (1989), T. Cardiovasc. Pharmacol . 13 (suppl.5): s.143-146 (endothelin- 
1); Hubbard etal. (1989), Annals Int. Med . 3: 206-12 (al -antitrypsin); Smith 
etal. (1989), T. Clin. Invest . 84: 1145-6 (al-proteinase); Oswein etal. (March 
1990), "Aerosolization of Proteins", Proc. Svmp . Resp. Drug Delivery II, 
Keystone, Colorado (recombinant human growth hormone); Debs et al. 

1 5 (1988), T. Immunol . 140: 3482-8 (interf eron-y and tumor necrosis factor a) 
and Platz etal., U.S. Patent No. 5,284,656 (granulocyte colony stimulating 
factor). 

Contemplated for use in the practice of this invention are a wide 
range of mechanical devices designed for pulmonary delivery of 

2 0 therapeutic products, including but not limited to nebulizers, metered 
dose inhalers, and powder inhalers, all of which are familiar to those 
skilled in the art. Some specific examples of commercially available 
devices suitable for the practice of this invention are the Ultravent 
nebulizer, manufactured by Mallinckrodt, Inc., St. Louis, Missouri; the 

2 5 Acorn II nebulizer, manufactured by Marquest Medical Products, 

Englewood, Colorado; the Ventolin metered dose inhaler, manufactured 
by Glaxo Inc., Research Triangle Park, North Carolina; andlhe Spinhaler 
powder inhaler, manufactured by Fisons Corp., Bedford, Massachusetts. 
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All such devices require the use of formulations suitable for the 
dispensing of the inventive compound. Typically, each formulation is 
specific to the type of device employed and may involve the use of an 
appropriate propellant material, in addition to diluents, adjuvants 
5 and/or carriers useful in therapy. 

The inventive compound should most advantageously be 
prepared in particulate form with an average particle size of less than 10 
^im (or microns), most preferably 0.5 to 5 urn, for most effective delivery 
to the distal lung. 

1 o Pharmaceutical^ acceptable carriers include carbohydrates such 

as trehalose, mannitol, xylitol, sucrose, lactose, and sorbitol. Other 
ingredients for use in formulations may include DPPC, DOPE, DSPC and 
DOPC. Natural or synthetic surfactants may be used. PEG may be used 
(even apart from its use in derivatizing the protein or analog). Dextrans, 
1 5 such as cyclodextran, may be used. Bile salts and other related enhancers 
may be used. Cellulose and cellulose derivatives may be used. Amino 
acids may be used, such as use in a buffer formulation. 

Also, the use of liposomes, microcapsules or microspheres, 
inclusion complexes, or other types of carriers is contemplated. 

2 0 Formulations suitable for use with a nebulizer, either jet or 

ultrasonic, will typically comprise the inventive compound dissolved in 
water at a concentration of about 0.1 to 25 mg of biologically active protein 
per mL of solution. The formulation may also include a buffer and a 
simple sugar (e.g., for protein stabilization and regulation of osmotic 
2 5 pressure). The nebulizer formulation may also contain a surfactant, to 
reduce or prevent surface induced aggregation of the protein caused by 
atomization of the solution in forming the aerosol. 

Formulations for use with a metered-dose inhaler device will 
generally comprise a finely divided powder containing the inventive 
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compound suspended in a propellant with the aid of a surfactant. The 
propellant may be any conventional material employed for this purpose, 
such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a 
hydrofluorocarbon, or a hydrocarbon, including rricMorofluoromethane, 
5 dichlorodifluoromethane, dichlorotetrafluoroethanol, and 1,1,1,2- 

tetrafluoroethane, or combinations thereof. Suitable surfactants include 
sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a 
surfactant. 

Formulations for dispensing from a powder inhaler device will 

1 0 comprise a finely divided dry powder containing the inventive compound 
and may also include a bulking agent, such as lactose, sorbitol, sucrose, 
mannitol, trehalose, or xylitol in amounts which facilitate dispersal of the 
powder from the device, e.g., 50 to 90% by weight of the formulation. 

Nasal delivery forms . Nasal delivery of the inventive compound is 

1 5 also contemplated. Nasal delivery allows the passage of the protein to the 
blood stream directly after administering the therapeutic product to the 
nose, without the necessity for deposition of the product in the lung. 
Formulations for nasal delivery include those with dextran or 
cyclodextran. Delivery via transport across other mucous membranes is 

2 0 also contemplated. 

Dosages . The dosage regimen involved in a method for treating the 
above-described conditions will be determined by the attending physician, 
considering various factors which modify the action of drugs, e.g. the age, 
condition, body weight, sex and diet of the patient, the severity of any infection, 

2 5 time of administration and other clinical factors. Generally, the daily regimen 
should be in the range of 0.1-1000 micrograms of the inventive compound per 
kilogram of body weight, preferably 0.1-150 micrograms per kilogram. 



SH 
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Specific preferred embodiments 

The inventors have determined preferred peptide sequences for 
molecules having many different kinds of activity. The inventors have 
further determined preferred structures of these preferred peptides 



5 combined with preferred linkers and vehicles. Preferred structures 
these preferred peptides listed in Table 21 below. 

Table 21 — Preferred embodiments 



Sequence/structure 
c 1 /n\ ipr^PTi ROWl AARA-fG^ -1EGPTLRQWLAARA 


CT70 

ID 

NO: 

337 


/vcvi v ny 
TPO-mimetic 


1 EG PTLRQWLAAR A-(G)„-1 EG PTLRQWLAAR A-(Ci )«- f- 
F -(Ca) 5 -ltvar 1 LHUWLMMnM 


338 
1032 


TPO-mimetic 
TPO-mimetic 


IEGPTLRQWLAARA -(G) s - F 1 


1033 


TPO-mimetic 


F'-(G) 5 -GGTYSCHFGPLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPLTWVCKPQGG 


339 


EPO-mimetic 


GGTYSCHFG PLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPLTWVCKPQGG-(G),-F' 


340 


EPO-mimetic 


GGTYSCHFGPLTWVCKPQGG-(G) S -F' 


1034 


EPO-mimetic 


F'-tGJs-DFLPHYKNTSLGHRP 


1045 


TNF-a inhibitor 


DFLPHYKNTSLGHRP-(G) 5 -F' 


1046 


TNF-a inhibitor 


F'-(G) a - FEWTPGYWQPYALPL 


1047 


IL-1 R antagonist 


FEWTPGYWQPYALPL-(G) S -F' 


1048 


IL-1 R antagonist 


F'-(G) 5 -VEPNCDIHVMWEWECFERL 


1049 


VEGF-antagonist 


VEPNCDIHVMWEWECFERL-fG^-F 1 


1050 


VEGF-antagonist 


F'-(G) S -CTTHWGFTLC 


1051 


MMP inhibitor 


CTTHWGFTLC-(G) 5 -F' 


1052 


MMP inhibitor 



"F 1 " is an Fc domain as defined previously herein. 



Working examples 
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The compounds described above may be prepared as described 
below. These examples comprise preferred embodiments of the invention 
and are illustrative rather than limiting. 

Example 1 

5 TPQ-Mimetics 

The following example uses peptides identified by the numbers 
appearing in Table A hereinafter. 

Preparation of peptide 19 . Peptide 17b (12 mg) and MeO-PEG-SH 
5000 (30 mg, 2 equiv.) were dissolved in 1 ml aqueous buffer (pH 8). The 

0 mixture was incubated at RT for about 30 minutes and the reaction was 
checked by analytical HPLC, which showed a > 80% completion of the 
reaction. The pegylated material was isolated by preparative HPLC. 

Preparation of peptide 20 . Peptide 18 (14 mg) and MeO-PEG- 
maleimide (25 mg) were dissolved in about 1.5 ml aqueous buffer (pH 8). 

5 The mixture was incubated at RT for about 30 minutes, at which time 
about 70% transformation was complete as monitored with analytical 
HPLC by applying an aliquot of sample to the HPLC column. The 
pegylated material was purified by preparative HPLC. 

Bioactivitv assay . The TPO in vitro bioassay is a mitogenic assay 

0 utilizing an IL-3 dependent clone of murine 32D cells that have been 
transfected with human mpl receptor. This assay is described in greater 
detail in WO 95/26746. Cells are maintained in MEM medium containing 
10% Fetal Clone II and 1 ng/ml mIL-3. Prior to sample addition, cells are 
prepared by rinsing twice with growth medium lacking mIL-3. An 

5 extended twelve point TPO standard curve is prepared, ranging from 33 
to 39 pg/ml. Four dilutions, estimated to fall within the linear portion of 
the standard curve, (100 to 125 pg/ml), are prepared for each sample and 
run in triplicate. A volume of 100 ul of each dilution of sample or 
standard is added to appropriate wells of a 96 well microtiter plate 

St 
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containing 10,000 cells/well. After forty-four hours at 37 °C and 10% CO z , 
MTS (a tetrazolium compound which is bioreduced by cells to a formazan) 
is added to each well. Approximately six hours later, the optical density is 
read on a plate reader at 490 nm. A dose response curve (log TPO 
5 concentration vs. O.D.- Background) is generated and linear regression 
analysis of points which fall in the linear portion of the standard curve is 
performed. Concentrations of unknown test samples are determined 
using the resulting linear equation and a correction for the dilution factor. 
TMP tandem repeats with polvglycine linkers . Our design of 

1 0 sequentially linked TMP repeats was based on the assumption that a 

dimeric form of TMP was required for its effective interaction with c-Mpl 
(the TPO receptor) and that depending on how they were wound up 
against each other in the receptor context, the two TMP molecules could 
be tethered together in the C- to N-termimis configuration in a way that 

1 5 would not perturb the global dimeric conformation. Clearly, Ihe success 
of the design of tandem linked repeats depends on proper selection of the 
length and composition of the linker that joins the C- and N-terrnini of the 
two sequentially aligned TMP monomers. Since no structural information 
of the TMP bound to c-Mpl was available, a series of repeated peptides 

2 0 with linkers composed of 0 to 10 and 14 glycine residues (Table A) were 
synthesized. Glycine was chosen because of its simplicity and flexibility, 
based on the rationale that a flexible polyglycine peptide chain might 
allow for the free folding of the two tethered TMP repeats into the 
required conformation, while other amino acid sequences may adopt 

2 5 undesired secondary structures whose rigidity might disrupt the correct 
packing of the repeated peptide in the receptor context. 

The resulting peptides are readily accessible by conventional solid 
phase peptide synthesis methods (Merrifield (1963), ]. Amer. Chem. Soc. 
85: 2149) with either Fmoc or t-Boc chemistry. Unlike the synthesis of the 

g-7 
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C-terminally linked parallel dimer which required the use of an 
orthogonally protected lysine residue as the initial branch point to build 
the two peptide chains in a pseudosymmetrical way (Cwirla etaL (1997), 
Science 276: 1696-9), the synthesis of these tandem repeats was a 
5 straightforward, stepwise assembly of the continuous peptide chains from 
the C- to N-terminus. Since dimerization of TMP had a more dramatic 
effect on the proliferative activity than binding affinity as shown for the C- 
terminal dimer (Cwirla etaL (1997)), the synthetic peptides were tested 
directly for biological activity in a TPO-dependent cell-proliferation assay 

1 0 using an IL-3 dependent clone of murine 32D cells transfected with the 
full-length c-Mpl (Palacios etaL,. Cell 41:727 (1985)). As the test results 
showed, all the polyglycine linked tandem repeats demonstrated >1000 
fold increases in potency as compared to the monomer, and were even 
more potent than the C-terminal dimer in this cell proliferation assay. The 

1 5 absolute activity of the C-terminal dimer in our assay was lower than that 
of the native TPO protein, which is different from the previously reported 
findings in which the C-terminal dimer was found to be as active as the 
natural ligand (Cwirla etaL (1997)). This might be due to differences in 
the conditions used in the two assays. Nevertheless, the difference in 

2 0 activity between tandem (C terminal of first monomer linked to N 

terminal of second monomer) and C-terminal (C terminal of first monomer 
linked to C terminal of second monomer; also referred to as parallel) 
dimers in the same assay clearly demonstrated the superiority of tandem 
repeat strategy over parallel peptide dimerization. It is interesting to note 

2 5 that a wide range of length is tolerated by the linker. The optimal linker 
between tandem peptides with the selected TMP monomers apparently is 
composed of 8 glycines. " "~ 

Other tandem repeats . Subsequent to this first series of TMP 
tandem repeats, several other molecules were designed either with 
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different linkers or containing modifications within the monomer itself. 
The first of these molecules, peptide 13, has a linker composed of GPNG, a 
sequence known to have a high propensity to form a p-turn-type 
secondary structure. Although still about 100-fold more potent than the 
5 monomer, this peptide was found to be >10-fold less active than the 

equivalent GGGG-linked analog. Thus, introduction of a relatively rigid 
p-turn at the linker region seemed to have caused a slight distortion of the 
optimal agonist conformation in this short linker form. 

The Trp9 in the TMP sequence is a highly conserved residue among 

10 the active peptides isolated from random peptide libraries. There is also a 
highly conserved Trp in the consensus sequences of EPO mimetic peptides 
and this Trp residue was found to be involved in the formation of a 
hydrophobic core between the two EMPs and contributed to hydrophobic 
interactions with the EPO receptor. Livnah etal. (1996), Science 273: 464- 

15 71). By analogy, the Trp9 residue in TMP might have a similar function in 
dimerization of the peptide ligand, and as an attempt to modulate and 
estimate the effects of noncovalent hydrophobic forces exerted by the two 
indole rings, several analogs were made resulting from mutations at the 
Trp. So in peptide 14, the Trp residue was replaced in each of the two 

2 0 TMP monomers with a Cys, and an intramolecular disulfide bond was 
formed between the two cysteines by oxidation which was envisioned to 
mimic the hydrophobic interactions between the two Trp residues in 
peptide dimerization. Peptide 15 is the reduced form of peptide 14. In 
peptide 16, the two Trp residues were replaced by Ala. As the assay data 

2 5 show, all three analogs were inactive. These data further demonstrated 
that Trp is critical for the activity of the TPO mimetic peptide, not just for 
dimer formation. 

The next two peptides (peptide 17a, and 18) each contain in their 8- 
amino acid linker a Lys or Cys residue. These two compounds are 
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precursors to the two PEGylated peptides (peptide 19 and 20) in which the 
side chain of the Lys or Cys is modified by a PEG moiety. A PEG moiety 
was introduced at the middle of a relatively long linker, so that the large 
PEG component (5 kDa) is far enough away from the critical binding sites 

5 in the peptide molecule. PEG is a known biocompatible polymer which is 
increasingly used as a covalent modifier to improve the pharmacokinetic 
profiles of peptide- and protein-based therapeutics, 

A modular, solution-based method was devised for convenient 
PEGylation of synthetic or recombinant peptides. The method is based on 

0 the now well established chemoselective ligation strategy which utilizes 
the specific reaction between a pair of mutually reactive functionalities. 
So, for pegylated peptide 19, the lysine side chain was preactivated with a 
bromoacetyl group to give peptide 17b to accommodate reaction with a 
thiol-derivatized PEG. To do that, an orthogonal protecting group, Dde, 

5 was employed for the protection of the lysine s-amine. Once the whole 
peptide chain was assembled, the N-terminal amine was reprotected with 
t-Boc. Dde was then removed to allow for the bromoacetylation. This 
strategy gave a high quality crude peptide which was easily purified using 
conventional reverse phase HPLC. Ligation of the peptide with the thiol- 

0 modified PEG took place in aqueous buffer at pH 8 and the reaction 
completed within 30 minutes. MALDI-MS analysis of the purified, 
pegylated material revealed a characteristic, bell-shaped spectrum with an 
increment of 44 Da between the adjacent peaks. For PEG-peptide 20, a 
cysteine residue was placed in the linker region and its side chain thiol 

5 group would serve as an attachment site for a maleimide-containing PEG. 
Similar conditions were used for the pegylation of this peptide. As the 
assay data revealed, these two pegylated peptides had evefT higher in vitro 
bioactivity as compared to their unpegylated counterparts. 

fD 
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Peptide 21 has in its 8-amino acid linker a potential glycosylation 
motif, NGS. Since our exemplary tandem repeats are made up of natural 
amino acids linked by peptide bonds, expression of such a molecule in an 
appropriate eukaryotic cell system should produce a glycopeptide with 
5 the carbohydrate moiety added on the side chain carboxyamide of Asn. 
Glycosylation is a common post-translational modification process which 
can have many positive impacts on the biological activity of a given 
protein by increasing its aqueous solubility and in vivo stability. As the 
assay data show, incorporation of this glycosylation motif into the linker 

1 0 maintained high bioactivity. The synthetic precursor of the potential 
glycopeptide had in effect an activity comparable to that of the -(G) 8 - 
linked analog. Once glycosylated, this peptide is expected to have the 
same order of activity as the pegylated peptides, because of the similar 
chemophysical properties exhibited by a PEG and a carbohydrate moiety. 

15 The last peptide is a dimer of a tandem repeat. It was prepared by 

oxidizing peptide 18, which formed an intermolecular disulfide bond 
between the two cysteine residues located at the linker. This peptide was 
designed to address the possibility that TMP was active as a tetramer. The 
assay data showed that this peptide was not more active than an average 

2 0 tandem repeat on an adjusted molar basis, which indirectly supports the 
idea that the active form of TMP is indeed a dimer, otherwise dimerization 
of a tandem repeat would have a further impact on the bioactivity. 

In order to confirm the in vitro data in animals, one pegylated TMP 
tandem repeat (compound 20 in Table A) was delivered subcutaneously to 

2 5 normal mice via osmotic pumps. Time and dose-dependent increases 

were seen in platelet numbers for the duration of treatment. Peak platelet 
levels over 4-fold baseline were seen on day 8. A dose of 10"ug/kg/ day of 
the pegylated TMP repeat produced a similar response to rHuMGDF 
(non-pegylated) at 100 ug/kg/day delivered by the same route. 
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Table A— TPO-mimetic Peptides 



Peptide Compound SEQ ID Relative 

N NO: Potency 





TPO 




++++ 




TMP monomer 


13 


+ 




TMP C-C dimer 




+++- 


IP-(G) n -TMP: 






1 


n = 0 


341 


++++- 


2 


n = 1 


342 


++++ 


3 


n = 2 


343 


++++ 


4 


n = 3 


344 


++++ 


5 


n = 4 


345 


++++ 


6 


n = 5 


346 


++++ 


7 


n = 6 


347 


++++ 


8 


n = 7 


348 


++++ 


9 


n = 8 


349 


++_ 


10 


n = 9 


350 


+-M-+ 


11 


n = 10 


qc-i 
OO I 


++++ 




n = 14 


352 


++++ 


13 


TMP-GPNG-TMP 


353 


+++ 


14 


IEGPTLRQ£LAARA-GGGGGGGG-IEGPTLRQ£LAARA 
1 1 


354 






(cyclic) 


355 




15 


IEGPTLRQCLAARA-GGGGGGGG- 


- 




IEGPTLRQC.LAARA (linear) 






16 


IEGPTLRQALAARA-GGGGGGGG- 


356 






IEGPTLRQA.LAARA 






17a 


TMP-GGGKGGGG-TMP 


357 


++++ 


17b 


TMP-GGGK(BrAc)GGGG-TMP 


358 


ND 


18 


TMP-GGGCGGGG-TMP 


359 


++++ 


19 


TMP-GGGK(PEG)GGGG-TMP 


360 


+++++ 


20 


TMP-GGGC(PEG)GGGG-TMP 


361 


+++++ 


21 


TMP-GGGN*GSGG-TMP 


362 


++++ 


22 


TMP-GGGCGGGG-TMP 


363- 


++++ 




1 

TMP-GGGCGGGG-TMP 


363 
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Discussion . It is well accepted that MGDF acts in a way similar to 
hGH, i.e., one molecule of the protein ligand binds two molecules of the 
receptor for its activation. Wells §131.(1996), Ann. Rev. Biochem. 65: 609- 
34. Now, this interaction is mimicked by the action of a much smaller 
5 peptide, TMP. However, the present studies suggest that this mimicry 
requires the concerted action of two TMP molecules, as covalent 
dimerization of TMP in either a C-C parallel or C-N sequential fashion 
increased the in vitro biological potency of the original monomer by a 
factor of greater than 10 s . The relatively low biopotency of the monomer is 
1 0 probably due to inefficient formation of the noncovalent dimer. A 

preformed covalent repeat has the ability to eliminate the entropy barrier 
for the formation of a noncovalent dimer which is exclusively driven by 
weak, noncovalent interactions between two molecules of the small, 14- 
residue peptide. 

15 it is intriguing that this tandem repeat approach had a similar effect 

on enhancing bioactivity as the reported C-C dimerization is intriguing. 
These two strategies brought about two very different molecular 
configurations. The C-C dimer is a quasi-symmetrical molecule, while the 
tandem repeats have no such symmetry in their linear structures. Despite 

2 0 this difference in their primary structures, these two types of molecules 
appeared able to fold effectively into a similar biologically active 
conformation and cause the dimerization and activation of c-Mpl. These 
experimental observations provide a number of insights into how the two 
TMP molecules may interact with one another in binding to c-Mpl. First, 

25 the two C-termini of the two bound TMP molecules must be in relatively 
close proximity with each other, as suggested by data on the C-terminal 
dimer. Second, the respective N- and C-termini of the two TMP molecules - 
in the receptor complex must also be very closely aligned with each other, 
such that they can be directly tethered together with a single peptide bond 
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to realize the near maximum activity-enhancing effect brought about by 
the tandem repeat strategy. Insertion of one or more (up to 14) glycine 
residues at the junction did not increase (or decrease) significantly the 
activity any further. This may be due to the fact that a flexible polyglycine 
5 peptide chain is able to loop out easily from the junction without causing 
any significant changes in the overall conformation. This flexibility seems 
to provide the freedom of orientation for the TMP peptide chains to fold 
into the required conformation in interacting with the receptor and 
validate it as a site of modification. Indirect evidence supporting this 

1 0 came from the study on peptide 13, in which a much more rigid b-turn- 
forming sequence as the linker apparently forced a deviation of the 
backbone alignment around the linker which might have resulted in a 
slight distortion of the optimal conformation, thus resulting in a moderate 
(10-fold) decrease in activity as compared with the analogous compound 

1 5 with a 4-Gly linker. Third, Trp9 in TMP plays a similar role as Trpl3 in 
EMP, which is involved not only in peptide:peptide interaction for the 
formation of dimers but also is important for contributing hydrophobic 
forces in peptidetreceptor interaction. Results obtained with the W to C 
mutant analog, peptide 14, suggest that a covalent disulfide linkage is not 

2 0 sufficient to approximate the hydrophobic interactions provided by the 
Trp pair and that, being a short linkage, it might bring the two TMP 
monomers too close, therefore perturbing the overall conformation of the 
optimal dimeric structure. 

An analysis of the possible secondary structure of the TMP peptide 

2 5 can provide further understanding on the interaction between TMP and c- 
Mpl. This can be facilitated by making reference to the reported structure 
of the EPO mimetic peptide. Livnah etal. (1996), Science 273:464-75 The 
receptor-bound EMP has a b-hairpin structure with a b-turn formed by the 
highly consensus Gly-Pro-Leu-Thr at the center of its sequence. Instead of 

*7 
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GPLT, TMP has a highly selected GPTL sequence which is likely to form a 
similar turn. However, this turn-like motif is located near the N-terminal 
part in TMP- Secondary structure prediction using Chau-Fasman method 
suggests that the C-terminal half of the peptide has a tendency to adopt a 

5 helical conformation. Together with the highly conserved Tip at position 
9, this C-terminal helix may contribute to the stabilization of the dimeric 
structure. It is interesting to note that most of our tandem repeats are 
more potent than the C-terminal parallel dimer. Tandem repeats seem to 
give the molecule a better fit conformation than does the C-C parallel 

0 dimerization. The seemingly asymmetric feature of a tandem repeat 

might have brought it closer to the natural ligand which, as an asymmetric 
molecule, uses two different sites to bind two identical receptor molecules. 

Introduction of a PEG moiety was envisaged to enhance the in vivo 
activity of the modified peptide by providing it a protection against 

5 proteolytic degradation and by slowing down its clearance through renal 
filtration. It was unexpected that pegylation could further increase the in 
vitro bioactivity of a tandem repeated TMP peptide in the cell-based 
proliferation assay. 

Example 2 

0 FoTMP fusions 

TMPs (and EMPs as described in Example 3) were expressed in 
either monomeric or dimeric form as either N-terminal or C-terminal 
fusions to the Fc region of human IgGl. In all cases, the expression 
construct utilized the luxPR promoter promoter in the plasmid expression 

5 vector p AMG21 . 

FoTMP. A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the TPO-mimetic peptide was constructed 
using standard PCR technology. Templates for PCR reactions were the 
pFc-A3 vector and a synthetic TMP gene. The synthetic gene was 
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10 



constructed from the 3 overlapping oligonucleotides (SEQ ID NOS: 364, 
365, and 366, respectively) shown below: 

1842-97 AAA AAA GGA TCC TCG AGA TTA AGC ACG AGC AGC CAG CCA 

CTG ACG CAG AGT CGG ACC 

1842-9 8 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG ACT CTG CGT 

1842-99 CAG TGG CTG GCT GCT CGT GCT TAA TCT CGA GGA TCC TTT 

TTT 

These oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 367 and 368, respectively) shown 
below: 

15 AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 

x + + + + + + 60 

CCAGGCTGAGACGCAGTCACCGACCGACGAGCACGA 
a KGGGGGIEGPTLRQWLAARA 

2 0 TAATCTCGAGGATCCTTTTTT 

61 + + " 81 

ATTAGAGCTCCTAGGAAAAAA 

a * 

2 5 This duplex was amplified in a PCR reaction using 1842-98 and 1842-97 as 

the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers shown below (SEQ ID NOS: 369 and 370): 

30 1216-52 AAC ATA AGT ACC TGT AGG ATC G 

1830-51 TTC GAT ACC A CCACCTCCAC CTTTACCCGG AGACAGGGAG AGGCTCTTCTGC 

The oligonucleotides 1830-51 and 1842-98 contain an overlap of 24 

3 5 nucleotides, allowing the two genes to be fused together in the correct 

reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1842-97. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHI, and then ligated 

4 0 into the vector pAMG21 and transformed into competent E. coli strain 

2596 cells as described for EMP-Fc herein. Clones were screened for the 
ability to produce the recombinant protein product and to possess the 

% 
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gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3728. 

The nucleotide and amino acid sequences (SEQ ID NOS: 5 and 6) of 
the fusion protein are shown in Figure 7. 
5 Fc-TMP-TMP . A DNA sequence coding for the Fc region of human 

IgGl fused in-frame to a dimer of the TPO-mimetic peptide was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFc-A3 vector and a synthetic TMP-TMP gene. The synthetic 
gene was constructed from the 4 overlapping oligonucleotides (SEQ ID 
1 0 NOS: 371 to 374, respectively) shown below: 

1830-52 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG 

ACT CTG CGT CAG TGG CTG GCT GCT CGT GCT 

15 1830-53 ACC TCC ACC ACC AGC ACG AGC AGC CAG 

CCA CTG ACG CAG AGT CGG ACC 

1830-54 GGT GGT GGA GGT GGC GGC GGA GGT ATT GAG GGC CCA ACC 

CTT CGC CAA TGG CTT GCA GCA CGC GCA 

20 

1830-55 AAA AAA AGG ATC CTC GAG ATT ATG CGC GTG CTG CAA GCC 

ATT GGC GAA GGG TTG GGC CCT CAA TAC CTC CGC CGC C 

The 4 oligonucleotides were annealed to form the duplex encoding an 
2 5 amino acid sequence (SEQ ID NOS: 375 and 376, respectively) shown 
below: 

AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 

1 + + + + + + 60 

o q CCAGGCTGAGACGCAGTCACCGACCGACGAGCACGA 
a KGGGGGIEGPTLRQWLAARA 

GGTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCA 

+ + + + + + 120 

CCACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGT 

GGGGGGGGIEGPTLRQWLAA 



61 

35 



a 

CGC GCA 

121 " 148 

4 0 GCGCGTATTAGAGCTCCTAGGAAAAAAA 

a R A * - 



This duplex was amplified in a PCR reaction using 1830-52 and 1830-55 as 
4 5 the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 1216-52 and 1830-51 as described above for 

*7 
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Fc-TMP. The full length fusion gene was obtained from a third PCR 
reaction using the outside primers 1216-52 and 1830-55. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHI, and then ligated 
5 into the vector p AMG21 and transformed into competent E. coli strain 
2596 cells as described in example 1. Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3727. 

10 The nucleotide and amino acid sequences (SEQ ID NOS: 7 and 8) of 

the fusion protein are shown in Figure 8. 

TMP-TMP-Fc . A DN A sequence coding for a tandem repeat of the 
TPO-mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 

15 were the EMP-Fc plasmid from strain #3688 (see Example 3) and a 
synthetic gene encoding the TMP dimer. The synthetic gene for the 
tandem repeat was constructed from the 7 overlapping oligonucleotides 
shown below (SEQ ID NOS: 377 to 383, respectively): 

20 1885-52 TTT TTT CAT ATG ATC GAA GGT CCG ACT CTG CGT CAG TGG 

1885-53 AGC ACG AGC AGC CAG CCA CTG ACG CAG AGT CGG ACC TTC 

GAT CAT ATG 

25 1885-54 CTG GCT GCT CGT GCT GGT GGA GGC GGT GGG GAG AAA ACT 

CAC ACA 



30 



1885-55 CTG GCT GCT CGT GCT GGC GGT GGT GGC GGA GGG GGT GGC 

ATT GAG GGC CCA 

1885-56 AAG CCA TTG GCG AAG GGT TGG GCC CTC AAT GCC ACC CCC 

TCC GCC ACC ACC GCC 

1885-57 ACC CTT CGC CAA TGG CTT GCA GCA CGC GCA GGG GGA GGC 

3 5 GGT GGG GAC AAA ACT 

1885-58 CCC ACC GCC TCC CCC TGC GCG TGC TGC 

These oligonucleotides were annealed to form the duplex shown encoding 
40 an amino acid sequence shown below (SEQ ID NOS 384 and 385): 
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TTTTTTCATATGATCGAAGGTCCGACTCTGCGTC AGTGGCTGGCTGCTCGTGCTGGCGGT 

1 + + + + + + o0 

GTATACTAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGACCGCCA 
a MIEGPTLRQWLAARAGG- 

5 GGTGGCGGAGGGGGTGGCATTGAGGGCCCAACCCTTCGCCAATGGCTGGCTGCTCGTGCT 

61 -.- + + + + + ----+ 120 

CCACCGCCTCCCCCACCGTAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTGCGCGT 
a G G G G G G I E G P T L R Q W . L . A A R A 

0 GGTGGAGGCGGTGGGGACAAAACTCTGGCTGCTCGTGCTGGTGGAGGCGrGTGGGGACAAA 

121 :- + + + + + + 

CCCCCTCCGCCACCC 

a ggGGGDKTLAARAGGGGGDK - 

5 

ACTCACACA 
181 189 

a T H T - 

This duplex was amplified in a PCR reaction using 1885-52 and 1885-58 as 

the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 

with DNA from the EMP-Fc fusion strain #3688 (see Example 3) using the 
5 primers 1885-54 and 1200-54. The full length fusion gene was obtained 

from a third PCR reaction using the outside primers 1885-52 and 1200-54. 
The final PCR gene product (the full length fusion gene) was 

digested with restriction endonucleases Xbal and BamHL and then ligated 

into the vector pAMG21 and transformed into competent E. coli strain 
0 2596 cells as described for Fc-EMP herein. Clones were screened for the 

ability to produce the recombinant protein product and to possess the 

gene fusion having the correct nucleotide sequence. A single such clone 

was selected and designated Amgen strain #3798. 

The nucelotide and amino acid sequences (SEQ ID NOS: 9 and 10) 
5 of the fusion protein are shown in Figure 9. 

TMP-Fc . A DNA sequence coding for a monomer of the TPO- 

mimetic peptide fused in-frame to the Fc region of human IgGl was 

obtained fortuitously in the ligation in TMP-TMP-Fc, presumably due to 

the ability of primer 1885-54 to anneal to 1885-53 as well as to 1885-58. A 
0 single clone having the correct nucleotide sequence for the TMP-Fc 

construct was selected and designated Amgen strain #3788. 

1T 
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The nucleotide and amino acid sequences (SEQ ID NOS: 11 and 12) 
of the fusion protein are shown in Figure 10. 

Expression in E. coli . Cultures of each of the pAMG21-Fc-fusion 
constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 

5 containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3oxohexanoyl)-DL-homoserine lactone to the 
culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 

0 cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 
were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 

5 10% b-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

pAMG21 . The expression plasmid p AMG21 can be derived from 
the Amgen expression vector pCFM1656 (ATCC #69576) which in turn be 

0 derived from the Amgen expression vector system described in US Patent 
No. 4,710,473. The pCFM1656 plasmid can be derived from the described 
pCFM836 plasmid (Patent No. 4,710,473) by: 

(a) destroying the two endogenous Ndel restriction sites by end 
filling with T4 polymerase enzyme followed by blunt end 

5 ligation; 

(b) replacing the DNA sequence between the unique AatH and Clal 
restriction sites containing the synthetic Pl promoter with a 
similar fragment obtained from pCFM636 (patent No. 4,710,473) 
containing the PL promoter (see SEQ ID NO: 386 below); and 
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(c) substituting the small DNA sequence between the unique Clai 
and Kpnl restriction sites with the oligonucleotide having the 
sequence of SEQ ID NO: 388. 
SEQIDNO:386: 

5 Aatll 

5 ' CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT - 

3 ' TGC AGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTTAAGTATA - 

- AAAAAACATACAGATAACC ATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAA - 
1 0 - TTTTTTGTATGTCTATTGGTAGACGCCACTATTTAATAGAGACCGCCACAACTGTATTT - 

- TACCACTGGCGGTGATACTGAGCACAT 3 ' 

- ATGGTGACCGCCACTATGACTCGTGTAGC 5 ' 

Clai 



15 



20 



SEQ ID NO: 387: 

5 ' CGATTTGATTCTAGAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGGTAC 3 ' 
3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 ' 

Clai ^EI! 1 



The expression plasmid pAMG21 can then be derived from pCFM1656 by 
making a series of site-directed base changes by PGR overlapping oligo 
mutagenesis and DNA sequence substitutions. Starting with the Bglll site 
(plasmid bp # 180) immediately 5' to the plasmid replication promoter 
2 5 PcopB and proceeding toward the plasmid replication genes, the base pair 
changes are as shown in Table B below. 



to) 
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Table B— Base pair changes resulting in pAMG21 

pAMG21 bp # hp in pOFM1656 hp plunged to in pAMG£1 

5 # 204 T/A C/G 

# 428 A/T G/C 

# 509 G/C A/T 
#617 . . insert two G/C bp 

# 679 G/C T/A 
10 # 980 T/A C/G 

# 994 G/C A/T 
#1004 A/T C/G 
#1007 C/G T/A 
#1028 A/T T/A 

15 #1047 C/G T/A 

#1178 G/C T/A 

#1466 G/C T/A 

# 2028 G/C bp deletion 
#2187 C/G T/A 

20 # 2480 A/T T/A 

# 2499-2502 A GTG SICA 

TCAC CAGT 

25 #2642 TCCGAGC 7 bp deletion 

AGGCTCG 

#3435 G/C A/T 

#3446 G/C A/T 

30 # 3643 A/T T/A 



/0> 
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The DNA sequence between the unique Aatll (position #4364 in 
pCFM1656) and SacH (position #4585 in pCFM1656) restriction sites is 
substituted with the DNA sequence (SEQ ID NO: 23) shown in Figures 
17A and 17B. During the ligation of the sticky ends of this substitution 
5 DNA sequence, the outside Aat n and Sadl sites are destroyed. There are 
unique Aatl l and SacH sites in the substituted DNA. 

(A m^en #2596 ). The Amgen host strain #2596 is an E.coli K- 
12 strain derived from Amgen strain #393. It has been modified to contain 
both the temperature sensitive lambda repressor cI857s7 in the early ebg 

1 0 region and the lacl° repressor in the late ebg region (68 minutes). The 
presence of these two repressor genes allows the use of this host with a 
variety of expression systems, however both of these repressors are 
irrelevant to the expression from luxP R . The untransformed host has no 
antibiotic resistances. 

1 5 The ribosome binding site of the cI857s7 gene has been modified to 

include an enhanced RBS. It has been inserted into the ebg operon 
between nucleotide position 1170 and 1411 as numbered in Genbank 
accession number M64441Gb_Ba with deletion of the intervening ebg 
sequence. The sequence of the insert is shown below with lower case 

2 0 letters representing the ebg sequences flanking the insert shown below 
(SEQ ID NO: 388): 

ttattttcatGCGGCCGCACCATTATCACCGCCAGAGGTAAACTAGTCAACACGCACGGTGTTAGATATTTAT 
CCCTTGCGGTGATAGATTGAGCACATCGATTTGATTCTAGAAGGAGGGATAATATATGAGCACAAAAAAGAAA 
CCATTAACACAAGAGCAGCTTGAGGACGCACGTCGGCTTAAAGCAATTTATGAAAAAAAGAAAAATGAACTTG 

2 5 gc^atcccaImaatctc^^ 

caatg^Xtta^tccttataac^ 

l^TCGCCAGAGAATCTACGAGATGTATGAAGCGGTTAGTATGCAGCCGTCACTTAGAAGTGAGTATGAGTA 

CCC^GTTOTTTCTCATGTTCAGGCAGGGATGTTCTCAC 

AGATGGGTAAGCACAACCAAAAAAGCCAGTGATTC 

3 0 caccaacAgII^ 

AGGTGATTTCTGCATAGCCAGW 

GTGTTTTTACAACCACTAAACCCACAGTACCCAATGATC 

TTATCGCTAGTCAGTGGCCTGAAGAGACGTTTGGCTGATAGACTAGTGGATCCACTAGTg 1 1 tc tgCCC 

3 5 The construct was delivered to the chromosome using a 

recombinant phage called MMebg-cI857s7enhanced RBS #4 into F'tet/393. 
After recombination and resolution only the chromosomal insert described 

103, 
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above remains in the cell. It was renamed Ftet/GMlOl. F'tet/GMlOl was 
then modified by the delivery of a lacl° construct into the ebg operon 
between nucleotide position 2493 and 2937 as numbered in the Genbank 
accession number M64441GbJBa with the deletion of the intervening ebg 
5 sequence. The sequence of the insert is shown below with the lower case 
letters representing the ebg sequences flanking the insert (SEQ ID NO: 
389) shown below: 

ggcggaaaccGACGTCCATCGAATGGTGCAAAACCTTTCGCGGTATGGCA 

ATTCAGGGTGGTGAATGTGAAACCAGTAACGTTATACGATGTCGCAGAGTATGCCGGTGTCTCTTATCAGACC 
10 GTTTCCCGCGTGGTGAACCAGGCCAGCCACGTTTCTGCGAAAACGCGGGAAAAAGTCGAAGCGGCGATGGCGG 
AGCTGAATTACATTCCCAACCGCGTGGCACAACAACTGGCGGGCAAACAGTCGCTCCTGATTGGCGTTGCCAC 
CTCCAGfCTGGCCCTGCACGCGCCGTCGCAAATTGTCGCGGCGATTAAATCTCGCGCCGATCAACTGGGTGCC 
AGCGTGGTGGTGTCGATGGTAGAACGAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCTTCTCGCGC 
AACGCGTCAGTGGGCTGATCATTAACTATCCGCTGGATGACCAGGATGCCATTGCTGTGGAAGCTGCCTGCAC 
15 TAATGTTCCGGCGTTATTTCTTGATGTCTCTGACCAGACACCCATCAACAGTATTATTTTCTCCCATGAAGAC 
GGTACGCGACTGGGCGTGGAGCATCTGGTCGCATTGGGTCACCAGCAAATCGCGCTGTTAGCGGGCCCATTAA 
GTTCTGTCTCGGCGCGTCTGCGTCTGGCTGGCTGGCATAAATATCTCACTCGCAATCAAATTCAGCCGATAGC 
GGAACGGGAAGGCGACTGGAGTGCCATGTCCGGTTTTCAACAAACCATGCAAATGCTGAATGAGGGCATCGTT 
CCCACTGCGATGCTGGTTGCCAACGATCAGATGGCGCTGGGCGCAATGCGCGCCATTACCGAGTCCGGGCTGC 
20 GCGTTGGTGCGGATATCTCGGTAGTGGGATACGACGATACCGAAGACAGCTCATGTTATATCCCGCCGTTAAC 
CACCATCAAACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGGCCAG 
GCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCTGGCGCCCAATACGCAAA 
CCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGACA 
GTAAGGTACCATAGGATCCaggcacagga 

25 

The construct was delivered to the chromosome using a 
recombinant phage called AGebg-LacIQ#5 into Ftet/GMlOl. After 
recombination and resolution only the chromosomal insert described 
above remains in the cell It was renamed F'tet/GM221. The F'tet episome 
3 0 was cured from the strain using acridine orange at a concentration of 25 
Kig/ml in LB. The cured strain was identified as tetracyline sensitive and 
was stored as GM221. 

Expression . Cultures of pAMG21-Fc-TMP-TMP in E. coli GM221 in 

3 5 Luria Broth medium containing 50 ng/ml kanamycin were incubated at 

37°C prior to induction. Induction of Fc-TMP-TMP gene product 
expression from the luxPR promoter was achieved following the addition 
of the synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to 
the culture media to a final concentration of 20 ng/ml and cultures were 

4 0 incubated at 37°C for a further 3 hours. After 3 hours, the bacterial 
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cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-TMP-TMP 
was.most likely produced in the insoluble fraction in E. coli . Cell pellets 

5 were lysed directly by resuspension in Laemmli sample buffer containing 
10% •-mercaptoethanol and were analyzed by SDS-PAGE. An intense 
Coomassie stained band of approximately 30kDa was observed on an 
SDS-PAGE gel. The expected gene product would be 269 amino acids in 
length and have an expected molecular weight of about 29.5 kDa. 

0 Fermentation was also carried out under standard batch conditions at the 
10 L scale, resulting in similar expression levels of the Fc-TMP-TMP to 
those obtained at bench scale. 

Purification of Fc-TMP-TMP . Cells are broken in water (1/10) by 
high pressure homogenization (2 passes at 14,000 PSI) and inclusion 

5 bodies are harvested by centrifugation (4200 RPM in J-6B for 1 hour). 
Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM DTT, 
pH 8.7 for 1 hour at a 1 /10 ratio. The solubilized mixture is diluted 20 
times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 

0 10 fold by ultafilrration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 
acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5(10mg/ml protein load, room temperature). 

5 The protein is eluted off using a 20 column volume gradient in the same 
buffer ranging from lOOmM NaCl to 500mM NaCl. The pool from the 
column is diluted 3 fold and loaded onto a SP-Sepharose HP column in 20 
mM NaAc, 150 mM NaCl, pH 5(10 mg/ml protein load, room 
temperature). The protein is eluted off using a 20 column volume gradient 

IK 
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in the same buffer ranging from 150 mM NaCl to 400 mM NaCl. The peak 

is pooled and filtered. 

Characterization of Fc-TMP activity . The following is a summary of 

in vivo data in mice with various compounds of this invention. 
5 Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 

started 4 days apart for a total of 20 mice per group. Five mice bled at each 

time point, mice were bled a minimum of three times a week. Mice were 

anesthetized with isoflurane and a total volume of 140-160 ul of blood was 
0 obtained by puncture of the orbital sinus. Blood was counted on a 

Technicon HIE blood analyzer running software for murine blood. 

Parameters measured were white blood cells, red blood cells, hematocrit, 

hemoglobin, platelets, neutrophils. 

Treatments: Mice were either injected subcutaneously for a bolus 
5 treatment or implanted with 7-day micro-osmotic pumps for continuous 

delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 

Osmotic pumps were inserted into a subcutaneous incision made in the 

skin between the scapulae of anesthetized mice. Compounds were diluted 

in PBS with 0.1% BSA. All experiments included one control group, 
0 labeled "carrier" that were treated with this diluent only. The 

concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 

the graphs. 

Compounds: A dose titration of the compound was delivered to 
5 mice in 7 day micro-osmotic pumps. Mice were treated with various 
compounds at a single dose of 100 ug/kg in 7 day osmotic pumps. Some 
of the same compounds were then given to mice as a siriglebolus injection. 

Activity test results: The results of the activity experiments are 
shown in Figures 11 and 12. In dose response assays using 7-day micro- 
ti, 
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osmotic pumps, the maximum effect was seen with the compound of SEQ 
ID NO: 18 was at 100 ug/kg/day; the 10 ug/kg/day dose was about 50% 
maximally active and 1 ug/kg/day was the lowest dose at which activity 
could be seen in this assay system. The compound at 10 ug/kg/day dose 
5 was about equally active as 100 ug/kg/day unpegylated rHu-MGDF in 
the same experiment. 
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Example 3 
Fc-EMP fusions 

FoEMP . A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the EPO-mimetic peptide was constructed 
5 using standard PCR technology. Templates for PCR reactions were a 
vector containing the Fc sequence (pFc-A3, described in International 
application WO 97/23614, published July 3, 1997) and a synthetic gene 
encoding EPO monomer. The synthetic gene for the monomer was 
constructed from the 4 overlapping oligonucleotides (SEQ ID NOS: 390 to 
1 0 393, respectively) shown below: 

1798-2 TAT GAA AGG TGG AGG TGG TGG TGG AGG TAC TTA CTC TTG 
CCA CTT CGG CCC GCT GAC TTG G 

15 1798-3 CGG TTT GCA AAC CCA AGT CAG CGG GCC GAA GTG GCA AGA 
GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT CAT 



20 



45 



1798-4 GTT TGC AAA CCG CAG GGT GGC GGC GGC GGC GGC GGT GGT 
ACC TAT TCC TGT CAT TTT 

1798-5 CCA GGT CAG CGG GCC AAA ATG AC A GGA ATA GGT ACC ACC 
GCC GCC GCC GCC GCC ACC CTG 



The 4 oligonucleotides were annealed to form the duplex encoding an 
2 5 amino acid sequence (SEQ ID NOS: 394 and 395, respectively) shown 
below: 



TATGAAAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTG 



1 



+ 60 



3 0 TACTTTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAAC 
b MK GGGGGGGTYSCHFGPLTW 

GGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTT 
gj. -------*- + -- -- -*--- + -- *- -- -- - + "~*--"-"-^--*****"-^-""""""""'"'"^" — - -- -- -- - + — 133 

35 CCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACC 
b VC KPQGGGGGGGGTYSCHF 

This duplex was amplified in a PCR reaction using 

40 1798-18 GCA GAA GAG CCT CTC CCT GTC TCC GGG TAA 

AGG TGG AGG TGG TGG TGG AGG TAC TTA 
CTC T 



and 



179 8-19 CTA ATT GGA TCC ACG AGA TTA ACC ACC 

CTG CGG TTT GCA A 
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as the sense and antisense primers (SEQ ID NOS: 396 and 397, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

5 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

179 8-17 AGA GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT ACC CGG 

AGA CAG GGA GAG GCT CTT CTG C 

10 

which are SEQ ID NOS: 398 and 399, respectively. The oligonucleotides 
1798-17 and 1798-18 contain an overlap of 61 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
above PCR products in a third reaction using the outside primers, 1216-52 

15 and 1798-19. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHL and then ligated 
into the vector p AMG21 (described below), also digested with Xbal and 
Bam HL Ligated DNA was transformed into competent host cells of E. coli 

2 0 strain 2596 (GM221, described herein). Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3718. 

The nucleotide and amino acid sequence of the resulting fusion 

2 5 . protein (SEQ ID NOS: 15 and 16) are shown in Figure 13. 

EMP-Fc . A DNA sequence coding for a monomer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFC-A3a vector and a synthetic gene encoding EPQ monomer. 

3 0 The synthetic gene for the monomer was constructed from the 4 

overlapping oligonucleotides 1798-4 and 1798-5 (above) and 1798-6 and 
1798-7 (SEQ ID NOS: 400 and 401, respectively) shown below: 
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1798-6 GGC CCG CTG ACC TGG GTA TGT AAG CCA CAA GGG GGT GGG 
GGA GGC GGG GGG TAA TCT CGA G 

5 1798-7 GAT CCT CGA GAT TAC CCC CCG CCT CCC CCA CCC CCT TGT 
GGC TTA CAT AC 

The 4 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 402 and 403, respectively) shown 
10 below: 

GTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGC 

1 + + + + - - + + 60 

GTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCG 
15 A vCKPQGGGGGGGGTYSC HFG 

CCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGTAATCTCGAG 

61 + + + + + + - 122 

GGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCATTAGAGCTCCTAG 
20 A PLTWVCKPQGGGGGGG * 

This duplex was amplified in a PCR reaction using 



25 



30 



35 



1798-21 TTA TTT CAT ATG AAA GGT GGT AAC TAT TCC TGT CAT TTT 

and 

1798-22 TGG AC A TGT GTG AGT TTT GTC CCC CCC GCC TCC CCC ACC 

CCC T 

as the sense and antisense primers (SEQ ID NOS: 404 and 405, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

1798-23 AGG GGG TGG GGG AGG CGG GGG GGA CAA AAC TCA CAC ATG 

TCC A 



and 

40 1200-54 GTT ATT GCT CAG CGG TGG CA 

which are SEQ ID NOS: 406 and 407, respectively. The oligonucleotides 
1798-22 and 1798-23 contain an overlap of 43 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
4 5 above PCR products in a third reaction using the outside primers, 1787-21 
and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHI, and then ligated 
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into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described above. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
5 and designated Amgen strain #3688. 

The nucleotide and amino acid sequences (SEQ ID NOS: 17 and 18) 
of the resulting fusion protein are shown in Figure 14. 

EMP-EMP-Fc . A DNA sequence coding for a dimer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
1 0 constructed using standard PCR technology. Templates for PCR reactions 
were the EMP-Fc plasmid from strain #3688 above and a synthetic gene 
encoding the EPO dimer. The synthetic gene for the dimer was 
constructed from the 8 overlapping oligonucleotides (SEQ ID NOS:408 to 
415, respectively) shown below: 



15 



20 



25 



30 



35 



45 



1869- 


-23 


TTT 
TAG 


TTT 
AAG 


ATC 
GAG 


GAT 
GAA 


TTG 
TAA 


ATT 
AAT 


CTA 
ATG 


GAT 


TTG 


AGT 


TTT 


AAC 


TTT 


1869 


-48 


TAA 
AA 


AAG 


TTA 


AAA 


CTC 


AAA 


TCT 


AGA 


ATC 


AAA 


TCG 


ATA 


AAA 


1871 


-72 


GGA 
GTT 


GGT 
TGC 


ACT 
AAA 


TAC 
CCG 


TCT 


TGC 


CAC 


TTC 


GGC 


CCG 


CTG 


ACT 


TGG 


1871 


-73 


AGT 
ATT 


CAG 
TTA 


CGG 
TTC 


GCC 
CTC 


GAA 
CTT 


GTG 
C 


GCA 


AGA 


GTA 


AGT 


ACC 


TCC 


CAT 


1871 


-74 


CAG 
CAT 


GGT 
TTT 


GGC 
GGC 


GGC 
CCG 


GGC 
CTG 


GGC 
ACC 


GGC 
TGG 


GGT 


GGT 


ACC 


TAT 


TCC 


TGT 


1871 


-75 


AAA 
ACC 


ATG 
CTG 


ACA 
CGG 


GGA 
TTT 


ATA 
GCA 


GGT 
AAC 


ACC 
CCA 


ACC 


GCC 


GCC 


GCC 


GCC 


GCC 


1871 


-78 


GTA 
AAA 


TGT 
ACT 


AAG 
CAC 


CCA 
ACA 


CAA 
TGT 


GGG 
CCA 


GGT 


GGG 


GGA 


GGC 


GGG 


GGG 


GAC 


1871 


-79 


AGT 
ACA 


TTT 
TAC 


GTC 
CCA 


CCC 
GGT 


CCC 
CAG 


GCC 
CGG 


TCC 
GCC 


CCC 


ACC 


CCC 


TTG 


TGG 


CTT 



4 0 The 8 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 416 and 417, respectively) shown 
below: 



TTTT1 v rATCGATTT GATTCTAGATTTGAGTTTTAACTTTTAGAAGGAGGAATAAAATATG 

1 + + + + + + 

AAAAAATAGC TAAACTAAGATCTAAACTC AAAATTGAAAATCTTCC TCCTTATTTTATAC 

M 



60 



Ml 
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GGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTTGC AAACCGCAGGGTGGC ^ 

61 CC TCCATGAATGAG AACGGTGAAGCCGGGC GACTGAACCC AAACGTTTGGCGTCCC ACCG 
5 a GGTYSCHFGPLTWVCKPQGG 

GGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAG 

121 + + + + *** + 180 

CCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTC 

10 a GGGGGGTY S C HF G P LTWVC K 

CCACAAGGGGGTGGGGGAGGCGGGGGGGACAAAACTCACACATGTCCA 

181 + " + + + 228 

GGTGTTCCCCCACCCCCTCCGCCCCCCCTGTTTTGA 

15 a pQGGGGGGGDKTHTCP 

This duplex was amplified in a PCR reaction using 1869-23 and 
1871-79 (shown above) as the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
2 0 with strain 3688 DNA using the primers 1798-23 and 1200-54 (shown 
above). 

The oligonucleotides 1871-79 and 1798-23 contain an overlap of 31 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 

2 5 using the outside primers, 1869-23 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for ability to 

3 0 produce the recombinant protein product and possession of the gene 

fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3813. 

The nucleotide and amino acid sequences (SEQ ID NOS: 19 and 20, 
respectively) of the resulting fusion protein are shown in Figure 15. There 
3 5 is a silent mutation at position 145 (A to G, shown in boldface) such that 
the final construct has a different nucleotide sequence than the 
oligonucleotide 1871-72 from which it was derived. 

Fc-EMP-EMP . A DNA sequence coding for the Fc region of human 
IgGl fused in-frame to a dimer of the EPO-mimetic peptide was 
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constructed using standard PCR technology. Templates for PCR reactions 
were the plasmids from strains 3688 and 3813 above. 

The Fc portion of the molecule was generated in a PCR reaction 
with strain 3688 DNA using the primers 1216-52 and 1798-17 (shown 
5 above). The EMP dimer portion of the molecule was the product of a 

second PCR reaction with strain 3813 DNA using the primers 1798-18 (also 
shown above) and SEQ ID NO: 418, shown below: 

179 8-20 CTA ATT GGA TCC TCG AGA TTA ACC CCC TTG TGG CTT ACAT 

The oligonucleotides 1798-17 and 1798-18 contain an overlap of 61 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1798-20. 
1 5 The final PCR gene product (the full length fusion gene) was 

digested with restriction endonucleases Xba l and BamHI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
2 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #3822. 

The nucleotide and amino acid sequences (SEQ ID NOS: — and — , 
respectively) of the fusion protein are shown in Figure 16. 

Characterization of Fc-EMP activity . Characterization was carried 

2 5 out in vivo as follows. 

Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 
started 4 days apart for a total of 20 mice per group. Five mice bled at 
each time point, mice were bled a maximum of three times a week. Mice 

3 0 were anesthetized with isoflurane and a total volume of 140-160 ml of 

blood was obtained by puncture of the orbital sinus. Blood was counted 

1 0 
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on a Technicon HIE blood analyzer running software for murine blood. 
Parameters measured were WBC, RBC, HOT, HGB, PUT, NEUT, LYMPH. 

Treatments: Mice were either injected subcutaneously for a bolus 
treatment or implanted with 7 day micro-osmotic pumps for continuous 
5 delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 
Osmotic pumps were inserted into a subcutaneous incision made in the 
skin between the scapulae of anesthetized mice. Compounds were diluted 
in PBS with 0.1% BSA. All experiments included one control group, 
labeled "carrier" that were treated with this diluent only. The 
1 0 concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 
the graphs. 

Experiments: Various Fc-conjugated EPO mimetic peptides (EMPs) 
were delivered to mice as a single bolus injection at a dose of 100 |ig/kg. 
1 5 Fc-EMPs were delivered to mice in 7-day micro-osmotic pumps. The 

pumps were not replaced at the end of 7 days. Mice were bled until day 
51 when HGB and HCT returned to baseline levels. 

Example 4 
TNF-oc inhibitors 

2 o Fc-TNF-a inhibitors . A DNA sequence coding for the Fc region of 

human IgGl fused in-frame to a monomer of the TNF-ct inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 
from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

2 5 primer 1216-52 and the antisense primer 2295-89 (SEQ ID NOS: 1112 and 
1113 , respectively). The nucleotides encoding the TNF-ct inhibitory 
peptide were provided by the PCR primer 2295-89 shown^elow: 



30 



1216-52 
2295-89 



AAC ATA AGT ACC TGT AGG ATC G 

CCG CGG ATC CAT TAC GGA CGG TGA CCC AGA GAG GTG TTT TTG TAG 
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TGC GGC AGG AAG TCA CCA CCA CCT CCA CCT TTA CCC 

The oligonucleotide 2295-89 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 

5 being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and BamHI, and then ligated into the 
vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 

0 produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4544. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1055 and 
1056) of the fusion protein are shown in Figures 19A and 19B. 

5 TNF-a inhibitor-Fc . A DNA sequence coding for a TNF-a inhibitory 

peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the TNF-a inhibitory peptide were 

0 provided by the sense PCR primer 2295-88, with primer 1200-54 serving as 
the antisense primer (SEQ ID NOS: 1117 and 407, respectively). The 
primer sequences are shown below: 

2295-88 GAA TAA CAT ATG GAC TTC CTG CCG CAC TAC AAA AAC ACC TCT CTG 

5 CAC CGT CCG GGT GGA GGC GGT GGG GAC AAA ACT 



1200-54 GTT ATT GCT CAG CGG TGG CA 

0 

The oligonucleotide 2295-88 overlaps the glycine linker and Fc portion c 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

la- 
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The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and Bam HI, and then ligated into the 
vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
5 produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4543. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1057 and 1058) of 
the fusion protein are shown in Figures 20A and 20B. 

10 Expression in E. coli . Cultures of each of the pAMG21-Fc-fusion 

constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 
containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 

1 5 culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 
cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 

2 0 were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 
10% p-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

25 Purification of Fc-peptide fusion proteins . Cells are broken in water 

(1/10) by high pressure homogenization (2 passes at 14,000 PSI) and 
inclusion bodies are harvested by centrifugation (4200 RPM in J-6B for 1 
hour). Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM 
DTT, pH 8.7 for 1 hour at a 1/10 ratio. The solubilized mixture is diluted 
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20 times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 
10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 
5 acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5 (lOmg/ml protein load, room temperature). 
The protein is eluted from the column using a 20 column volume gradient 
in the same buffer ranging from lOOmM NaCl to 500mM NaCl. The pool 

1 0 from the column is diluted 3 fold and loaded onto a SP-Sepharose HP 

column in 20mM NaAc, 150mM NaCl, pH 5(10mg/ml protein load, room 
temperature). The protein is eluted using a 20 column volume gradient in 
the same buffer ranging from 150mM NaCl to 400mM NaCl. The peak is 
pooled and filtered. 

15 Characterization of activity of Fc-TNF-a inhibitor and TNF-oc 

inhibitor -Fc . Binding of these peptide fusion proteins to TNF- a can be 
characterized by BIAcore by methods available to one of ordinary skill in 
the art who is armed with the teachings of the present specification. 

Example 5 

20 IL-1 Antagonists 

Fc-IL-1 antagonist . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an IL-1 antagonist peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 

2 5 from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

primer 1216-52 and the antisense primer 2269-70 (SEQ ID NOS: 1112 and 
1118, respectively). The nucleotides encoding the IL-1 antagonist peptide - 
were provided by the PCR primer 2269-70 shown below: 

m 
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1216-52 AAC ATA AGT ACC TGT AGG ATC G 

2269-70 CCG CGG ATC CAT TAC AGC GGC AGA GCG TAC GGC TGC CAG TAA CCC 

GGG GTC CAT TCG AAA CCA CCA CCT CCA CCT TTA CCC 

5 

The oligonucleotide 2269-70 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

1 o The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Nde l and BamHI, and then ligated into the 
vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
1 5 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4506. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1059 and 
1060) of the fusion protein are shown in Figures 21 A and 21B. 

IL-1 antaponist-Fc . A DNA sequence coding for an IL-1 antagonist 

2 0 peptide fused in-frame to the Fc region of human IgGl was constructed 

using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the IL-1 antagonist peptide were provided 
by the sense PCR primer 2269-69, with primer 1200-54 serving as the 
2 5 antisense primer (SEQ ID NOS: 1119 and 407, respectively). The primer 
sequences are shown below: 



2269-69 GAA TAA CAT ATG TTC GAA TGG ACC CCG GGT TAC TGG CAG CCG TAC GCT 

30 CTG CCG CTG GGT GGA GGC GGT GGG GAC AAA ACT 

1200-54 GTT ATT GCT CAG CGG TGG CA 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 
5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Nde l and Bam HL and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

10 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4505. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1061 and 
1062) of the fusion protein are shown in Figures 22A and 22B. Expression 
and purification were carried out as in previous examples. 

15 Characterization of Fc-IL-1 antagonist peptide and IL-1 antagonist 

peptide-Fc activity . IL-1 Receptor Binding competition between IL-lp, IL- 
1RA and Fc-conjugated IL-1 peptide sequences was carried out using the 
IGEN system. Reactions contained 0.4 nM biotin-IL-lR + 15 nM IL-l-TAG 
+ 3 uM competitor + 20 ug/ml streptavidin-conjugate beads, where 

20 competitors were IL-1RA, Fc-IL-1 antagonist, IL-1 antagonist-Fc). 

Competition was assayed over a range of competitor concentrations from 
3 uM to 1.5 pM. The results are shown in Table C below: 



II? 
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Table C— Results from IL-1 Receptor Binding Competition Assay 



IL-1pep-Fc Fc-IL-1pep IL-1ra 

5 Kl 281.5 59.58 1.405 

EC50 530.0 112.2 2.645 

95% Confidence Intervals 

10 EC50 280.2 to 1002 54.75 to 229.8 1.1 49 to 

6.086 



15 



Kl 148.9 to 532.5 29.08 to 122.1 0.6106 to 

3.233 

Goodness of Fit 

R2 0.9790 0.9687 0.9602 
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Example 6 
VEGF-Antagonists 
Fc-VEGF Antagonist . A DNA sequence coding for the Fc region of 
5 human IgGl fused in-frame to a monomer of the VEGF mimetic peptide 
was constructed using standard PCR technology. The templates for the 
PCR reaction were the pFc-A3 plasmid and a synthetic VEGF mimetic 
peptide gene. The synthetic gene was assembled by annealing the 
following two oligonucleotides primer (SEQ ID NOS: 1120 and 1121, 
10 respectively): 

2293-11 GTT GAA CCG AAC TGT GAC ATC CAT GTT ATG TGG GAA TGG GAA 

TGT TTT GAA CGT CTG 

2293-12 CAG ACG TTC AAA AC A TTC CCA TTC CCA CAT AAC ATG GAT GTC 

15 . ACA GTT CGG TTC AAC 

The two oligonucleotides anneal to form the following duplex encoding 
an amino acid sequence shown below (SEQ ID NOS 1122 ): 

20 

GTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTG 

1 + + + + ---- + ! 

CAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCAGAC 

25 a VEPNCDIHVMWEWECFERL 

This duplex was amplified in a PCR reaction using 2293-05 and 2293-06 as 
the sense and antisense primers (SEQ ID NOS. 1125 and 1126). 
3 0 The Fc portion of the molecule was generated in a PCR reaction 

with the pFc-A3 plasmid using the primers 2293-03 and 2293-04 as the 
sense and antisense primers (SEQ ID NOS. 1123 and 1124 r respectively). . 
The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-03 and 2293-06. These primers are shown below: 
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2293-03 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GAC AAA ACT CAC 

ACA TGT 

5 2293-04 GTC ACA GTT CGG TTC AAC ACC ACC ACC ACC ACC TTT ACC CGG 

AGA CAG GGA 

2293-05 TCC CTG TCT CCG GGT AAA GGT GGT GGT GGT GGT GTT GAA CCG 

AAC TGT GAC ATC 



10 



2293-06 CCG CGG ATC CTC GAG TTA CAG ACG TTC AAA ACA TTC CCA 



The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Ndel and BamHI, and then ligated into the 

1 5 vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4523. 

2 0 The nucleotide and amino acid sequences (SEQ ID NOS: 1063 and 

1 064) of the fusion protein are shown in Figures 23 A and 23B. 

VEGF antagonist -Fc . A DNA sequence coding for a VEGF mimetic 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The templates for the PCR reaction were 

2 5 the pFc-A3 plasmid and the synthetic VEGF mimetic peptide gene 

described above. The synthetic duplex was amplified in a PCR reaction 
using 2293-07 and 2293-08 as the sense and antisense primers (SEQ ID 
NOS. 1 127 and 1 128, respectively). 

The Fc portion of the molecule was generated in a PCR reaction 

3 0 with the pFc-A3 plasmid using the primers 2293-09 and 2293-10 as the 

sense and antisense primers (SEQ ID NOS. 1129 and 1130, respectively). 
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The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-07 and 2293-10. These primers are shown below: 

2293-07 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GTT GAA CCG AAC 

5 TGT GAC 

2293-08 AC A TGT GTG AGT TTT GTC ACC ACC ACC ACC ACC CAG ACG TTC 

AAA ACA TTC 

10 -2293-09 GAA TGT TTT GAA CGT CTG GGT GGT GGT GGT GGT GAC AAA ACT 

CAC ACA TGT 

2293-10 CCG CGG ATC CTC GAG TTA TTT ACC CGG AGA CAG GGA GAG 

The PCR gene product (the full length fusion gene) was digested 
1 5 with restriction endonucleases Nde l and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

2 0 and designated Amgen strain #4524. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1065 and 
1066) of the fusion protein are shown in Figures 24A and 24B. Expression 
and purification were carried out as in previous examples. 

25 Example 7 

MMP Inhibitors 
Fr-MMP inhibitor . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an MMP inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 

3 0 linker portion of the molecule was generated in a PCR reaction with DNA 

from the Fc-TNF-a inhibitor fusion strain #4544 (see Example 4) using the 
sense primer 1216-52 and the antisense primer 2308-67 (SEQ ID NOS: 1112 

1*3 
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and 1131, respectively). The nucleotides encoding the MMP inhibitor 
peptide were provided by the PCR primer 2308-67 shown below: 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

5 2308-67 CCG CGG ATC CAT TAG CAC AGG GTG AAA CCC CAG TGG GTG GTG 

CAA CCA CCA CCT CCA CCT TTA CCC 

The oligonucleotide 2308-67 overlaps the glycine linker and Fc portion of 
1 0 the template by 22 nucleotides, with the PCR resulting in the two genes 

being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHI, and then ligated into the 

vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
1 5 described for EMP-Fc herein. Clones were screened for the ability to 

produce the recombinant protein product and to possess the gene fusion 

having the correct nucleotide sequence. A single such clone was selected 

and designated Amgen strain #4597. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1067 and 
2 0 1068) of the fusion protein are shown in Figures 25A and 25B. Expression 

and purification were carried out as in previous examples. 

MMP Inhibitor-Fc . A DNA sequence coding for an MMP inhibitory 

peptide fused in-frame to the Fc region of human IgGl was constructed 

using standard PCR technology. The Fc and 5 glycine linker portion of the 

2 5 molecule was generated in a PCR reaction with DNA from the Fc-TNF-a 

inhibitor fusion strain #4543 (see Example 4). The nucleotides encoding 
the MMP inhibitory peptide were provided by the sense PCR primer 2308- 
66, with primer 1200-54 serving as the antisense primer (SEQ ID NOS: 
1132 and 407, respectively). The primer sequences are shown below: 

30 

2308-66 GAA TAA CAT ATG TGC ACC ACC CAC TGG GGT TTC ACC CTG TGC 

GGT GGA GGC GGT GGG GAC AAA 

3 5 1200-54 GTT ATT GCT CAG CGG TGG CA 

13*1 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 
5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
1 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4598. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1069 and 

1070) of the fusion protein are shown in Figures 26A and 26B. 

* * * 

1 5 The invention now being fully described, it will be apparent to one 

of ordinary skill in the art that many changes and modifications can be 
made thereto, without departing from the spirit and scope of the invention 
as set forth herein. 



2 o Abbreviations 

Abbreviations used throughout this specification are as defined 
below, unless otherwise defined in specific circumstances. 

Ac acetyl (used to refer to acetylated residues) 

AcBpa acetylated p-benzoyl-L-phenylalanine 
25 ADCC antibody-dependent cellular cytotoxicity 

Aib aminoisobutyric acid 

bA beta-alartine ~ 

Bpa p-benzoyl-L-phenylalanine 
BrAc bromoacetyl (BrCH 2 C(0) 
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10 



15 



20 



25 



BSA Bovine serum albumin 

Bzl Benzyl 

Cap Caproic acid 

CTL Cytotoxic T lymphocytes 

CTLA4 Cytotoxic T lymphocyte antigen 4 

DARC Duffy blood group antigen receptor 

DCC Dicylcohexylcarbodiimide 

Dde l-(4 / 4-dimethyl-2,6-dioxo-cyclohexylidene)ethyl 

EMP Erythropoietin-mimetic peptide 

ESI-MS Electron spray ionization mass spectrometry 

EPO Erythropoietin 

Fmoc fluorenylmethoxycarbonyl 

G-CSF Granulocyte colony stimulating factor 

GH Growth hormone 

HCT hematocrit 

HGB hemoglobin 

hGH Human growth hormone 

HOBt 1-Hydroxybenzotriazole 

HPLC high performance liquid chromatography 

IL interleukin 

IL-R interleukin receptor 

IL-1R interleukin-1 receptor 

IL-lra interleukin-1 receptor antagonist 

Lau Laurie acid 

LPS lipopolysaccharide 

LYMPH lymphocytes 

MALDI-MS Matrix-assisted laser desorption ionization.mass 

spectrometry 

Me methyl 

m 
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MeO methoxy 

MHC major histocompatibility complex 

MMP matrix metalloproteinase 

MMPI matrix metalloproteinase inhibitor 

5 1 -Nap 1 -napthylalanine 

NEUT neutrophils 

NGF nerve growth factor 

Nle norleucine 

NMP N-methyl-2-pyrrolidinone 

1 o PAGE polyacrylamide gel electrophoresis 

PBS Phosphate-buffered saline 

Pbf 2 / 2 / 4 / 6 / 7-pendamethyldihydrobenzofuran-5-sulfonyl 

PCR polymerase chain reaction 

Pec pipecolic acid 

15 PEG Polyethylene glycol) 

pGlu pyroglutamic acid 

Pic picolinic acid 

PLT platelets 

pY phosphotyrosine 

2 0 RBC red blood cells 

RBS ribosome binding site 

RT room temperature (25 °C) 

Sar sarcosine 

SDS sodium dodecyl sulfate 

2 5 STK serine-threonine kinases 

t-Boc tert-Butoxycarbonyl 

tBu tert-Butyl — 

TGF tissue growth factor 

THF thymic humoral factor 

m 



WO 00/24782 



PCT/US99/25044 



TK tyrosine kinase 

TMP Thrombopoietin-mimetic peptide 

TNF Tissue necrosis factor 

TPO Thrombopoietin 

5 TRAIL TNF-related apoptosis-inducing ligand 

Trt trityl 

UK urokinase 

UKR urokinase receptor 

VEGF vascular endothelial cell growth factor 

1 o VIP vasoactive intestinal peptide 

WBC white blood cells 
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What is claimed is: 

1 . A composition of matter of the formula 

and multimers thereof, wherein: 
5 F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(LVP 1 , - 

<L 4 ) r P 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 

1 o pharmacologically active peptides; 

L 1 , V, V, and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
that at least one of a and b is 1. 

2. The composition of matter of Claim 1 of the formulae 
is X'-F 1 

or 

f'-x 2 . 

3. The composition of matter of Claim 1 of the formula 

2 0 4. The composition of matter of Claim 1 of the formula 

5. the composition of matter of Claim 1 wherein F 1 is an IgG Fc 
domain. 

6. The composition of matter of Claim 1 wherein F 1 is an IgGl Fc 
2 5 domain. 

7. The composition of matter of Claim 1 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

8. The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
an IL-1 antagonist peptide sequence. 
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9. The composition of matter of Claim 8 wherein the IL-1 antagonist 
peptide sequence is selected from SEQ ID NOS: 212, 907, 908, 909, 
910, 917, and 979. 

10. The composition of matter of Claim 8 wherein the IL-1 antagonist 
5 peptide sequence is selected from SEQ ID NOS: 213 to 271, 671 to 

906, 911 to 916, and 918 to 1023. 

1 1 . The composition of matter of Claim 8 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

12. The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
1 0 an EPO-mimetic peptide sequence. 

13. The composition of matter of Claim 12 wherein the EPO-mimetic 
peptide sequence is selected from Table 5. 

14. The composition of matter of Claim 12 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

1 5 15. The composition of matter of Claim 12 comprising a sequence 

selected from SEQ ID NOS: 83, 84, 85, 124, 419, 420, 421, and 461. . 

16. The composition of matter of claim 12 comprising a sequence 
selected from SEQ ID NOS: 339 and 340. 

17. The composition of matter of Claim 12 comprising a sequence 
2 0 selected from SEQ ID NOS: 20 and 22. 

18. The composition of matter of Claim 3 wherein P 1 is a TPO-mimetic 
peptide sequence. 

19. The composition of matter of Claim 18 wherein P 1 is a TPO-mimetic 
peptide sequence selected from Table 6. 

2 5 20. The composition of matter of Claim 18 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

21. The composition of matter of Claim 18 having a sequence selected 
from SEQ ID NOS: 6 and 12. 

22. A DNA encoding a composition of matter of any of Claims 1 to 21. 

m 



An expression vector comprising the DNA of Claim 22. 

A host cell comprising the expression vector of Claim 23. 

The cell of Claim 24, wherein the cell is an E. coli cell. 

A process for preparing a pharmacologically active compound, 

which comprises 

a) selecting at least one randomized peptide that modulates the 
activity of a protein of interest; and 

b) preparing a pharmacologic agent comprising at least one Fc 
domain covalently linked to at least one amino acid sequence 
of the selected peptide or peptides. 

The process of Claim 26, wherein the peptide is selected in a process 
comprising screening of a phage display library, an E. coli display 
library, a ribosomal library, or a chemical peptide library. 
The process of Claim 26, wherein the preparation of the 
pharmacologic agent is carried out by: 

a) preparing a gene construct comprising a nucleic acid 
sequence encoding the selected peptide and a nucleic acid 
sequence encoding an Fc domain; and 

b) expressing the gene construct. 

The process of Claim 26, wherein the gene construct is expressed in 
an E. coli cell. 

The process of Claim 26, wherein the protein of interest is a cell 
surface receptor. 

The process of Claim 26, wherein the protein of interest has a linear 
epitope. 

The process of Claim 26, wherein the protein of interest is a 

cytokine receptor. ■•■ — 

The process of Claim 26, wherein the peptide is an EPO-mimetic 

peptide. 
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34. The process of Claim 26, wherein the peptide is a TPO-mimetic 
peptide. 

35. The process of Claim 26, wherein the peptide is an IL-1 antagonist 
peptide. 

5 36. The process of Claim 26, wherein the peptide is an MMP inhibitor 
peptide or a VEGF antagonist peptide. 

37. The process of Claim 26, wherein the peptide is a TNF-antagonist 
peptide. 

38. The process of Claim 26, wherein the peptide is a CTLA4-mimetic 
10 peptide. 

39. The process of Claim 26, wherein the peptide is selected from 
Tables 4 to 20. 

40. The process of Claim 26, wherein the selection of the peptide is 
carried out by a process comprising: 

15 a) preparing a gene construct comprising a nucleic acid 

sequence encoding a first selected peptide and a nucleic acid 
sequence encoding an Fc domain; 
b) conducting a polymerase chain reaction using the gene 
construct and mutagenic primers, wherein 
20 i) a first mutagenic primer comprises a nucleic acid 

sequence complementary to a sequence at or near the 
5' end of a coding strand of the gene construct, and 
ii) a second mutagenic primer comprises a nucleic acid 
sequence complementary to the 3' end of the 
2 5 noncoding strand of the gene construct. 

41. The process of Claim 26, wherein the compound is derivatized. 

42. The process of Claim 26, wherein the derivatized compound 
comprises a cyclic portion, a cross-linking site, a non-peptidyl 
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linkage, an N-terminal replacement, a C-terminal replacement, or a 
modified amino acid moiety. 
43. The process of Claim 26 wherein the Fc domain is an IgG Fc 
domain. 

5 44. The process of Claim 26, wherein the vehicle is an IgGl Fc domain. 

45. i The process of Claim 26, wherein the vehicle comprises the 

sequence of SEQ ID NO: 2. 

46. The process of Claim 26, wherein the compound prepared is of the 
formula 

io (XVFMX 2 ), 
and multimers thereof,wherein: 
F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(V) c -P\ - 
(L l ) c -P l -(L 2 ) d -P 2 , <L%-P^LX-F 2 -(L\-P\ and -(L l ) c -P 1 -(L 2 ) d -P 2 -(L 3 ) c -P 3 - 
15 (L 4 ),-P 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 

L 1 , L 2 , L 3 , and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
2 0 that at least one of a and b is 1. 

47. The process of Claim 46, wherein the compound prepared is of the 
formulae 

X'-F 1 



25 



or 

-1 \/2 



48. The process of Claim 46, wherein the compound prepared is of the 
formulae 
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or 

49. The process of Claim 46, wherein F 1 is an IgG Fc domain. 

50. The process of Claim 46, wherein F 1 is an IgGl Fc domain. 

51. The process of Claim 46, wherein F 1 comprises the sequence of SEQ 
ID NO: 2. 
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ATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCA 

1 + + + + + + 60 

TACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGT 

MDKTHTC PPC PAPELLGGPS 

GTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCGCTGAGGTC 

CAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAG 

V FLF PPKPKDTLMISRTPEV 

ACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTG 

121 + -f + + + + 180 

TGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCAC 

TCVVVDVSHEDPEVKFNWYV 

GACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACG 

181 + + + + + + 240 

CTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGC 

DGVEVHNAKTKPREEQYNST 

TACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTAC 

241 + + + +' + + 300 

ATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATG 

Y, RVVSVLTVLHQDWLNGKEY 

AAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCC 

301 ---- + + + + + * 360 

TTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGG 

KCKVSNKALPAPIEKTI SKA 

AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACC 

361 ---- + + + + + + 420 

TTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGG 

KGQ P RE PQVYT L P P S RD E LT 

AAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTG 

421 + + + + + + 480 

TTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCAC 

K NQVSLTCLVKGFYPSDIAV 

GAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGAC 

481 ---- + + + + + + 540 

CTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTG 

EWESNGQPENNYKTTPPVLD 

TCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAG 

541 + ....4.-- + f...-.-. + - -...----+ 600 

AGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTC , 

S DGS FF LYS KLTVDKS RWQQ 
GGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAG 

gni ...j + + + + + + 660 

CCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTC 

GNVFSC SVMHEALHNHYTQK 

AGCCTCTCCCTGTCTCCGGGTAAA 

661 ---- + + 684 

TC GG AG AGGGAC AG AGGCCC ATTT 
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FIG. 7 



TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTC P- 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + + + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 - - - + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMI SRT PEVTCVVVDV3- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + + + + + ♦ 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + + + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLN'GKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGT^ 

LPAPI EKTI 3 KAKGQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 - - - - + + + + + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPP3RDELTKNQ .V3LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 + + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFY PSDIAVE'WESMGQP* 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTPPVLDSDGS FFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 ♦ + + + + + S6Q 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
S KLTV D K S RWQQGNV F S C S V - 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + - + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
M HEALHNH YTQ K 3 L S h S PGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTT 

72i - + + + + + + 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAA 
G GGGG I EG PT L R Q W L# A A RA *- 

BamHI 
I 

AATCTCGAGGATCC 

781 + 794 

TTAGAGCTCCTAGG 
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TC T AG ATTTGTTTT AAC T AATT AAAGGAGG AAT AAC AT ATGG AC AAAACTC AC AC ATGTC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTCP- 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + + + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 - + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMI S R T PEVTC VVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 - + + + + ♦ + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HED PEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 I + + + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 - + + + + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLNGKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 - + + ♦ + * ♦ 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
LPAPIEKTI SKAKGQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 - + + + + + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPPSRDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 - ♦ ♦ + + + ♦ 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFY PSDIAVEWESNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTPPVLDSDGSFFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 - + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSLSLS PGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTG 

721 - + + + + : + " + 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAC 

GGGGG I EG PT L RQWLAARAG- 

GTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCAC 

781 - + + * + + * 840 

CACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTG 

GGGGGGGI EG PT LRQWLAAR- 

BamHI 
I 

GCGCATAATCTCGAGGATCCG 

841 + «" 861 

CGCGTATTAGAGCTCCTAGGC 

c A * - 
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TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGCGGTGGTGGCGGAGGGGGTGGCATTGAGGGCCCAA 

61 + + + + + + 120 

CAGTCACCGACCGACGAGCACGACCGCCACCACCGCCTCCCCCACCGTAACTCCCGGGTT 
QWliAARAGGGGGGGG I EG P T - 

CCCTTCGCCAATGGCTTGCAGCACGCGCAGGGGGAGGCGGTGGGGACAAAACTCACACAT 

12 1 + + + + ♦ + 180 

GGGAAGCGGTTACCGAACGTCGTGCGCGTCCCCCTCCGCCACCCCTGTTTTGAGTGTGTA 
LRQWLAARAGGGGGD KTHTC- 

GTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAA 

181 + + ♦ + + 240 

CAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTT 

p PC PAPELLGGPSVFLFPPK- 

AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 

2 41 . + + + + ♦ + 300 

TTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGC 
PKDTLMISRTPEVTCVVVDV- 

TGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATA 

301 + + + + + + 360 

ACTCGGTG^TTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTAT 
SHEDPEVKFNWYVDGVEVHN- 

ATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCC 

361 - + + + + + " + 420 

TACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGG 

A K T K PREEQYNSTYRVV SVL- 

TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 

421 . - + + + + + + 480 

AGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGT 
TVLHQDWLNGKEYKCKVSNK- 

AAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 

481 + + + + + + 540 

TTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTG 

ALPAPIEKTISKAKGQPREP* 

C ACAGGTGT AC ACCCTGCCQCC ATCCCGGG ATG AGCTG ACC AAGAACCAGGTC AGCCTGA ^ 

541 GTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACT 

QVYTLPPSRDELTKNQVSLT- 

CCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGC 

601 + + + + + + 660 

GGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCG 

CLVKGFYPSDIAVEWESNGO- 

AGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCC 

661 + + + + + + 720 

TCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGG 

PENNYKTTPPVLDSDGSFFIi- 



TCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCT 

AGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAA'QAGTACGA 
YSKLTVDKSRWQQGNVFSCS 

CCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 



781 + + * - 840 

GGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCC 
VMHEALHNHYTQKSLSLS P G - 

BamHI 
I 

GTAAATAATGGATCC 

841 + 855 

C ATTT ATT AC CT AGG 
K * 
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FIG. 10 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

x + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAAACTCACACATGTCCAC 

61 + + + + + + I 20 

CAGTCACCGACCGACGAGCACGACCACCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTG 

QWLAARAGGGGGDKTHTCPP- 

CTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCA 

121 + + + + + + 180 

GAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGT 

C PAPELLGGPSVFLFPPKPK- 

AGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCC 

181 + + + * + * 240 

TCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGG 

DTLMISRTPEVTCVVVDVSH- 

ACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCA 

241 + + + + + : + 300 

TGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGT 

ED P EVK'FNWYVD, GVEVHNAK- 

AGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCG 

301 + + + + + + 360 

TCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGC 

TKPREEQYNSTYRVVSVLTV- 

TCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCC 

2£i + + + ♦ + • * * + 420 

AGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGG 
LHQDWLNGKEYKCKVSNKAL- 

TCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGG 

421 + + + + + *** 480 

AGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCC 

PA P I EKTI S KAKGQ PRE PQV- 

TGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCC 

481 + + + + + + 540 

ACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGG 

YTLPPSRDELTKNQVSLTCL- 

TGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGG 
54^ _ _ _ + + + •--•-*•♦•----»-•-•"♦■ 600 

ACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCC 
VKGFYPSDIA VEWESNGQPE- 

AGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACA 

£Q\ -i- + + + + + 660 

TCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGT 
NNYKTT PPVLDSDGSFFLY S- 

GCAAGCTCACCGTGGACAAGAGC AGGTGGC AGC AGGGGAACGTCTTCTC ATGCTCCGTGA 

- - + + _ . . - + . + + + 720 

— CGTTCGAGTGGCACCTGTTCTCGTCC ACCGTCGTCCCCTTGC AGAAGAGTACGAGGC ACT 
KLTVDKS RWQQGNVFSC S V M - 

TGCATGAGGCTCTGC ACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAT 

721 + + + + + + 780 

ACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTA 

HEALHNHYTQKS LS LS PGK * - 

BamHX 
I 

AATGGATCC 
781 789 
TTACCTAGG 
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Xbal 

TCT AG ATTTGTTTT AAC T AATT AAAGGAGGAAT AAC AT ATGGAC AAAACTC AC AC ATGTC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

KD KTHTCP- 
CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

Si - + + + + + + 120 

GTGG AAC AGGTC G AGGC C TTG AGG AC C C C C C T GGC AGTC AG AAGG AG AAGGGGGGTTTTG 
PC PAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 - + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMISRT PEVTC VVVDVS- 

GC C AC G AAG AC C C TG AGGTC AAGTTC AACTGGT AC GTGG AC GGC GTGG AGGTGC AT AATG 

181 - + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HED PEVKF NWYVDGVEVHNA* 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + + + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ♦ + + + - - - - + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VliHQDWLNGKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + * + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCG^ 

L PA PI EKTI SKAKGQPRE P Q - 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 - + + + + + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPPSRDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 ♦ + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 

LVKGFY PSDIAVEWE SNGQ P - 
CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 

ENNYKTTPPVLDSDGSFFIiY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + * + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 

S KLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 ---------■+■--------- + - .-.-- + 720 

ACTACGT ACTC CGAG ACGTGTTGGTGATGTGC GTCTTCTCGG AGAGGGAC AGAGGCCC AT 
MHEALHNHYTQKSLSLS P G K - 

AAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTT 

721 + + + ♦ + + 780 

TTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAA 

GGGGGGGTY S C H'FG PLTWVC* 

BamHI 
I 

GCAAACCGCAGGGTGGTTAATCTCGTGGATCC 

7 81 + + + • " 812 

CGTTTGGCGTCCCACCAATTAGAGCACCTAGG 
K P Q G G * 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24782 



14/37 



PCT/US99/25044 



FIG. 14 



Xbal 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGGAGGTACTTACTCTTGCC 

X - - + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCCTCCATGAATGAGAACGG 

MGGTYSCH- 

ACTTCGGCCCGCTGACTTGGGTATGTAAGCCACAAGOGGGTGGGGGAGGCGGGGGGC3ACA 

TGAAGCCGGGCGACTGAACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCCTGT 
F G P LTWVC K PQGGGGGGGDK* 

AAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCC 

121 + + + + + - + 180 

TTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGG 
THTC P PC PAPELLGGPSVFL- 

TCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCG 

181 - + + + + + + 240 

AGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGC 
F PPKPKDTLMI SRTPEVTCV- 

TGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCG 

241 + - - + + + + + 300 

ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGC 
VVDVS HE D PEVKFNWYVDGV- 

TGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTG 

301 — + + + * * + 360 

ACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCAC 

EVHNAKTKPREEQYNSTYRV- 

TGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCA 

361 + - + + + + + 420 

ACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGT 
VSVLTVLHQDWLMGKEY-KCK- 



AGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGC 

TCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTC 
VSNKALPAPI EKTI 3 KAKG Q 

AGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACC 

TCGG^TCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACT^ 

PRE PQVYTLP PSRDELTKNQ 

AGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGG 

TCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCC 
V3LTCLVKGFYPSDIAVEWE 

AGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACG 



601 + + + - + 660 

TCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGC 
S N G Q PENNYKTT P PVODSDG- 

GCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACG 

661 + + + + + + 720 

CGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGC 
3 F F LY 3 KL'TV D K 3 RWQ Q G-NV- 

TCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACT AC ACGCAGAAGAGCCTCT 

721 + + + + + + 780 

AGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGA 

r S CSVMHEALHNHYTQKSLS- 

BamHI 
I 

CCCTGTCTCCGGGTAAATAATGGATCC 

781 + + 807 

GGGACAGAGGCCCATTTATTACCTAGG 

L 3 P G K * 
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Xbal 

TCTAGATTTGAGTTTTAACTTTTAGAAGGAGGAATAAAATA 

I + + + + + + 60 

AGATCTAAACTCAAAATTGAAAATCTTCCTCCTTATTTTATACCCTCCATGAATGAGAAC 

M G G T Y S C 

CCACTTCGGCCCACTGACTTGGGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGG 

61 + + + + + + 120 

GGTGAAGCCGGGTGACTGAACCCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACC 
H F G P LTWVC K PQGGGGGG'GG - 

TACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGG 

12 i + + + + + + 180 

ATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCC 
TYSCHFGPLTWVCKPQGGGG - 

AGGCGGGGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGG 

181 + ♦ + + + + 240 

TCCGCCCCCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCC 

GGGDKTHTCPPC PAPELLGG 

ACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCC 

241 + + + + + 300 

TGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGG 
PSVFLFPPKPKDTLMISRTP 

TGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTG 

301. + + + + + + 360 

ACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGAC 

EVTCVVVDVSHEDPEVKFNW - 

GTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAA 

361 + + + + + + 420 

CATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTT 

YVDGVEVHNAKTKPREEQYN- 

CAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAA 

421 + + + ♦ + + 480 

GTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTT 

STYRVVSVLTVLHQDWLNGK - 

GGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTC 

481 + + * + + + 540 

CCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAG 

EYKCKVSNKALPAPIEKTIS - 

CAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGA 
541 --------- + -- -"---"*- + ~ — ---.-»+--- — _.... + -..-.--..- + ■.•.■.-----« + 600 

GTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACT 
KAKGQPREPQVYTLPP3RDE - 



GCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACAT 

CGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTC 
LTKNQVSLTCLVKGFYPSD I 

CGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGT 



661 + + ♦ r - ---•+ 720 

GCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCA 
AVEWESNGQ PENNYKTT PPV 

GCTGGACTCCGACGGCTCCTTCTTCCTCTAC AGC AAGCTC ACCGTGGAC AAGAGCAGGTG 

721 + + + + - + + 780 

CGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCAC 
LDSDGSPFLYSKLTVDK3RW- 

GC AGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACAC 

781 + + + + + + 840 

CGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTG 
OQGNVFSCSVMHEALHNHYT- 

BamHI 
I 

GCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 



CGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 
Q KSLSLSPGK* 
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FIG. 16 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 ♦ + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 
c MD KTHTCP- 

CACCTTG^CCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCC 

61 + + + + + -•-- + 120 

GTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTC 
C PCPAPELLGGPSVFLPPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
C KDTLMISRTPEVTCV VVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
C H ED P EVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + + + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
C KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + + + + 360 

GGC AGGACGTGGTCCTGACC G ACTT AC C GTTC CTC ATGTTC AC GTTCC AGAGGTTGTTTC 
C VLHQD'WLMGKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTC 
c LPAPIEKTISKAKGQPREPQ- 

AGGTGTACACCCTGCCTCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + - + + + + + 480 

TCCACATGTGGGACGGAGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
C VYTLPPSRDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 + + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
C LVKGFYPSD IAVEWESNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 - + + ♦ + + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
c ENNYKTTP PVLD SDGS P F L Y - 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 - + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
C SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 - ♦ + + + + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
C MHEALHNHYTQKSLSLSPGK- 

AAGGTGGAGGTGGTGGCGGAGGTACTTACTCTTGCCACTTCGGCCCACTGACTTGGGTTT 

721 - + + + + + + 780 

TTCCACCTCCACCACCGCCTCCATGAATGAGAACGGTGAAGCCGGGTGACTGAACCCAAA 
C GGGGGGG'TY SCHFGPL T " W V C - 

GCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGC 

781 - + + + + + + 840 

CGTTTGGCGTC CC AC C GC C GC CGCC GC C GC C AC C ATGG AT AAGG AC AGT AAAACCGGGC G 
C K PQGGGGGGGGTY SCHFG PL- 

BamHI 
I 

TGACCTGGGTATGTAAGCCACAAGGGGGTTAATCTCGAGGATCC 

841 - + + + + 884 

ACTGGACCCATACATTCGGTGTTCCCCCAATTAGAGCTCCTAGG 
C TWVCKPQGG* 
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tAatH sticky end] 
(position #4358 in pAMG21) 



5 • GCGTAACGTATGCATGGTCTCC - 

3' TGCACGCATTGCATACGTACC AGAGG - 



- CCATGCGAGAGTAGGGAACTGCCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACT - 

- GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA - 

- GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC - 

- CCCGGAAAGCAAAATAGACAACAAACAGCCACTTGCGAGAGGACTCATCCTGTTTAGGCG - 

- CGGGAGCGGATTTGAACGTTGCGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGC - 

- GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCCACCGCCCGTCCTGCGGGCG - 

- C ATAAACTGCCAGGCATCAAATTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGT - 

- GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA - 

- TTC T AC AAAC TC TTTTGTTT ATTTTTC T AAAT AC ATTC AAATATGG AC GTCGT AC TT AAC - 

- AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGCATGAATTG - 

- TTTTAAAGTATGGGCAATCAATTGCTCCTGTTAAAATTGCTTTAGAAATACTTTGGCAGC - 

- AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG - 

- GGTTTGTTGTATTGAGTTTCATTTGCGCATTGGTTAAATGGAAAGTGACCGTGCGCTTAC * 

- CCAAACAACATAACTCAAAGTAAACGCGTAACCAATTTACCTTTCACTGGCACGCGAATG - 

- TACAGCCTAATATTTTTGAAATATCCCAAGAGCTTTTTCCTTCGCATGCCCACGCTAAAC - 

- ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG - 

- ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTATTTGCTATATTTATTTTTC - 

- T AAGAAAAAGAGAAAACC AATTT AGC AAC AAAC T AAATAAT AAAC G AT ATAAAT AAAAAG - 

- GATAATTATCAACTAGAGAAGGAACAATTAATGGTATGTTC ATAC ACGCATGTAAAAATA - 

- CTATTAATAGTTGATCTCTTCCTTGTTAATTACCATACAAGTATGTGCGTACATTTTTAT - 

- AACTATCTATATAGTTGTCTTTCTCTGAATGTGCAAAACTAAGCATTCCGAAGCCATTAT - 

- TTGATAGATATATCAACAGAAAGAGACTTACACGTTTTGATTCGTAAGGCTTCGGTAATA - 

- T AGC AGT ATGAAT AGGG AAAC T AAACCC AGTG AT AAG ACC TG ATG ATTTCGC TTC TTT AA - 

- ATCGTCATACTTATCCCTTTGATTTGGGTCACTATTCTGGACTACTAAAGCGAAGAAATT - 

- TT AC ATTTGGAGATTTTTTATTTAC AGCATTGTTTTC AAATATATTCCAATTAATCGGTG - 

- AATGTAAACCTCTAAAAAATAAATGTCGTAACAAAAGTTTATATAAGGTTAATTAGCCAC - 

- AATGATTGGAGTTAGAATAATCTACTATAGGATC ATATTTTATTAAATTAGCGTCATCAT - 

- TTACTAACCTCAATCTTATTAGATGATATCCTAGTATAAAATAATTTAATCGCAGTAGTA - 

- AATATTGCCTCCATTTTTTAGGGTAATTATCC AGAATTGAAATATCAGATTTAACCATAG - 

- TTATAACGGAGGTAAAAAATCCCATTAATAGGTCTTAACTTTATAGTCTAAATTGGTATC - 

- AATGAGGATAAATGATCGCGAGTAAATAATATTCACAATQTACCATTTTAGTC ATATCAG - 

- TTACTCCTATTTACTAGCGCTCATTTATTATAAGTGTTACATGGTAAAATCAGTATAGTC - 

- ATAAGCATTGATTAATATCATTATTGCTTCTACAGGCTTTAATTTTATTAATTATTCTGT - 

- T ATTC GT AACTAATT AT AGT AAT AAC G AAG ATGTC C GAAATT AAAAT AATT AAT AAGAC A - 

- AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC - 

- TTC AC AGC AGCCGTAAATAC AGAAAGTATGGGTAGAGAAATAGGAATGGATAAC AAACAG - 

- GC AAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGAC ATTTGATTCTAATAA - 

- CGTTCAAAACGCACAATATATAGTAATTTTGCCATTATCTAACTGTAAACTAAGATTATT - 
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FIG. 17B 



ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG 
TAACCTAAAAACAGTGTGATAATATAGCGAACTTTATGTTAACAAATTGTATTCATGGAC 

TAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTATAGTCGATTAATCGATTTGATT 
ATCCTAGCATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCTAAACTAA 

CTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGA 
GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT 

GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA 
CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT 

GAAGAAGAAGAAGAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATA 
CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT 

ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG 
TGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCC 

AACCGCTCTTCACGCTCTTCACGC 3 • [SacII sticky end] 

TTGGCGAGAAGTGCGAGAAGTG 5* (position #5904 in pAMG21) 
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FIG. 19A 

Ndel 1 1 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG ^ ^ 

1 GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGG 

MDKTHTCPPCPAPEL L G G P - 

TC AGTCTTCCTCTTCCCCCC AAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG ^ ^ q 
1 AG^CAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

S VFLFPPKPKDTLMISRTPE - 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACT^ ^ g q 

21 cagtgtacgSaccaccacctgcactcggtccttctgggactccagttca^^ 

V TCVVVDVSHEDPEVKFN WY - 

gtggacggcgtggaggtgcataatgccaagac^ 24 q 

1 8 1 CACCTGCCGCACCTCCACGTATTACGOT 

V D GVEVHNAKTKPREEQYNS - 
ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGG^ ^ 

241 tgcatggcaJaccagtcgcaggagtggcag^acgtg^^ 

T YRVVSVLTVLHQDWLNG KE 

tacaagtgcaaggtctccaacaaagc^ 3 g o 

301 ^TGTTCACGiTCCAGAGGTiGTTTCGGGAJGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

Y KCKVSNKALPAPIEKTISK - 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGG ^ ^ q 

361 CGGTTTCCCGTrcGGGGCTCTTGGTGTCCACATGTGGGACGGC^ 

A.KGQPREPQVYTLPPSRDEL - 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAG^ 4 g Q 

421 tggttcttgStccagtcggactggacggaJcagtttccgaagatag^ 

T KNQVSLTCLVKGFYPSDIA - 

gtggagtgggagagcaatgggcagccggagaacaactacaagaccacg^^ 54q 
481 cacctcaccJtctcgwacJcgtcggcctcttgttgatgttctggtgcggagggcacgac 

VEWESNG QPENNYKTTP-PVL-- 

gactccgacggctccttcttcctctacagcaagctcaccgtggacaagagcaggtggcag 
541 ctgaggctgccgaggaagaIggagatgtcgttcgagtggcacctgttctcgtccaccgtc 
dsdgsfflyskltvdksrwq - 
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FIG. 19B 

CAGGGGAACGTCTTCTC ATGCTCCGTGATGCATGAGGCTCTGCACAACC ACTACACGCAG 

fin i + + + + + + 660 

GTCCCCTTGC AGAAGAGTACGAGGC ACT ACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALH NHYTQ 
AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTGACTTCCTGCCGCACTAC 
661 TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCACTGAAGGACGGCGTGATG 
K S L S Ij.S PGKG GGGGD F L PHY 

BamHI 
I 

AAAAACACCTCTCTGGGTC ACCGTCCGTAATGG ATCC 

721 + + + 757 

TTTTTGTGGAGAGACCCAGTGGCAGGCATTACCTAGG 

KNTSLGHRP* 
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FIG. 20A 

Ndel 

CATATGGACTTCCTGCCGCACTACAAA^ 60 

1 gtatacctga^ggacggcgJgatgtttttgtggagagacccagtggcaggcccacctccg 

MDFLPHYKNTSL GHRPGGG - 

ggtggggacaaaactcacacatgtccaccttgcccagcacctgaactcctggggggaccg 
61 cc^cccctg^ttgagtgtStacaggtggaacgggtcgtggacttgaggacccccctggc 
g gdkthtcppcpapellggp - 
tcagttttcctcttccccccaaaacccaaggacaccctcatgat^ i8Q 
121 agtcaaaagSagaagggggSttttgggttcctgtgggagtactagag^c 

S VFLFPP. KPKDTLMISRTPE - 

gtcacatgcgtggtggtggacgtgagccacgaagaccctg^ 24 o 

181 cagtgtacgcaccaccacctgcactcggtgcttctgggactccagttcaagttgac 

VTC VVVDVSHEDPEVKFNWY - 

gtggacggcgtggaggtgcataatgccaagac^ 3oq 

24 1 CACCTGCCGCACCTCCACG^ ATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEV-HN AKTKPREEQYNS - 
ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGG^AAGGAG ^ 

301 tgcatggcacaccagtcgcIggag^ggcaggacgtggtcctgaccgacttaccgttcctc 

TYRVVSVLTVLHQDWLNGKE 

tacaagtgcaaggtctccaacaaagccctcccagcccccatcgagaaaaccatctcca^ 
361 atgttcacg^ccagaggtJgtttcgggagggtcgggggtagctcttttggtagaggttt 

y KCKVSNKALPAPlEKTlSK - 

ggcaaagggcagccccgagaaccacaggtgtacaccctgccccc^tcccgg^atgagctg 
4 2 1 cggtttcccgtcggggctcttggtgtccacatgtgggacgggggtagggccctactcgac 
x'kgqpRBPQvytlppsrdel - 

.ACCAAGAACCAGGTCAGCCTGAC^^ ^ 
4 8 1 TGGTT CTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKMQVSLTCLVKGPYPSDIA - 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCT^ ^ 

541 CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VBWBSNGQPBNMYKTTPPVL - 
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FIG. 20B 



GACT.CCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

g 01 + + + + + + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTC ATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG ^ 

6 6 1 GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 



Q G N V F S 



C SVMHEALHNHYTQ 



BamHI 
I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCCGCGG 

721 + + + + " ?61 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGGCGCC 

KSLSLSPGK* 
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FIG. 21A 

Ndel 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGG gQ 

1 GTATACCTGTTTTGAGTGTC^ 

MDKTHTCPPCPAPELLGGP - 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGA^ ^ 

6 1 AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAG 

SVFLFPPKPKDTLMISRTPE - 
GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAAC^ ^ 
121 CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY - 
GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTAC^ ^ 

181 cacctgccgJacctccacg^attacggttctgtttcggcgccctcctcgtcatgt^^ 

VDGVEVHNAKTKPREEQYNS - 

acgtaccgtgtggtcagcgtcctcaccgtcctgcaccaggactggctg^ 3qo 

241 tgcatggcac^ccagtcgcag^ 

t yrvvsvltvlhqdwlngke - 

tacaagtgcaaggtctccaacaaagccctcccagcccccatcgagaaa^ 3 6 o 

301 AT GTTCACG?TCCAGAGGTiGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

ykckvsnkalpap iektisk - 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACA^ 42Q 
361 + " * * 1 1 111-H^ G TCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

KG QPREPQV VTLPPSRDEL - 
ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGG^TTCTATC ^ 

421 + * ' " " "'HI- Itggacggaccagtttccgaagatagggtcgctgtagcgg 

KNQVSLTCLVKGFYPS DIA - 

gtggagtgggagagcaatgggcagccggagaacaactacaagacc^ s4 o 

4 81 + * lilim.ZZ CTCTTGTTGA TGTTCTGGTGCGGAGGGCACGAC 



CGGTTTCCCGTCGGGGCTCTTGGT 
A 

1 a a Annr^TTrTATCCCAGCGACATCGCC 

— - - r- 

TGGTTCTTGGTCCAGTCGGAC1 
T 

, .m^»nnr.pi aTnnnr AHCCGGAGAAC AACTA' 

.+.---- - . — - 

CACCTCACCCTCTCGTTACCCGTCGGCC1 
VEWESNGQPENNYKTTPPVL - 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG ^ 

541 CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

D S D G S F F L Y S K L T V D K S R W Q - 

SUBSTITUTE SHEET (RULE 26) 



PCT/US99/25044 

WO 00/24782 27/37 



FIG. 21B 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTG^ ^ 
601 gtCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGN VFSCSVMHEALHNHYTQ - 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGT^ 720 
6 6 1 TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAAAGCTTACCTGGGGCCCA 

K .SLSLSPGKGGGOGPBWTPO - 

BamHI 

TACTGGCAGCCGTACGCTCTGCCGCTGTAATGGATCCCTCGAG ^ 

721 atgaccgtcSgcatgcgagacggcgacattacctagggagctc 
ywq'pyalpl* 
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FIG. 22A 

Ndel 

CATATGTTCGAATGGACCCCGGGTTACTGGCAGCCGTACGCTCTGCCGCTGGGTGGAGGC 

+ - + + * - - H + 

GTATACAAGCTTACCTGGGGCCCAATGACCGTCGGCATGCGAGACGGCGACCCACCTCCG 

MFEWTPGYWQPYALPLGGG 
GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

ccacccctgttttgagtgtgtacaggtggaacgggtcgtggacttgaggacccccctggc 
g gdkthtc ppcpapellggp 



TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

^21 . . 4. - + .-4---.-----*-4--------»- + -- -- -- -- --f 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

18 1 + + + + + + 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEV KFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAAC AGC ^ 

241 CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

301 + + + + + + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 



T Y R V 



VSVLTVLHQDWLNGKE 



TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

3gi + + + + + + 420 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YK CKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

421 + + + + + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC ^ 

481 TG^TTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG _ 
TKNQVSLTCLVKGFYPSDIA 
GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG ^ ^ ^ 

541 c ACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 



V E W E S 



NGQPENNYKTTPPVL 
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FIG. 22B 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 ---- + + + + + + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

661 • --------+----- - + - + + . + .._..----+ 720 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

BamHI 
I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 ---- + + + 757 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

KSLSLSPGK* 
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FIG. 23A 



Ndel 

CATATGGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCG 

! + + + + + : + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGC 
MDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 ---- + '- + + + + + 120 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE. 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

12 i- - + + + + + + 18° 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 ■ + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKP. REEQ YNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 ---- + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLN GKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCC AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCtGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 



T K N Q 



VSLTCLVKGFYPSDIA 



GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 ..»;. + + + + +- + 540 

CACGTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTAC AGC AAGCTC ACCGTGGAC AAGAGC AGGTGGCAG 

c 41 ...- + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ - 
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FIG. 23B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + + + + + 66 ° 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 



QGNVFSCSVM-HEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGTGGTGGTGGTGTTGAACCGAACTGTGAC 

+ + + - _ - - + -----f + 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCACCACCACCACAACTTGGCTTGACACTG 

KSLSLS PGKGGGGGVEPNCD 

BamHI 
I 

ATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTGTAACTCGAGGATCC 

+ + + + + 773 

TAGGTACAATACACCCTTACCCTTACAAAACTTGCAGACATTGAGCTCCTAGG 

IHVMWEWECFERL* 
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FIG.24A 

Ndel 

I 

CATATGGTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGT 



1 - - - + + + + + + 60 

GTATACCAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCA 

MVE PNCD IHVMWEWECFER 

CTGGGTGGTGGTGGTGGTGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTC 

61 + + + + + + 120 

GACCCACCACCACCACCACTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAG 

LGGGGGDKTHTCPPCPAPEt, 

CTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCC 

121 + + + + + + 180 

GACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGG 

L-GGPSVFLFPPKPKDTLMIS 

CGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAG 

181 + + + + + + 240 

GCGTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTC 

R TPEVTCVVVDVSHEDPEVK 

TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAG 

241 + + + + + + 300 

AAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTC 

FNWYVDGVEVHNAKTKPREE 

CAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 

301 + + + + + + 360 

GTC.ATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGAC 

QYNSTYRVVSVLTVLHQDWL 

AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAA 

361 + + + + + + 420 

TTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTT 



NGKEYKCKVSNKALPAPIEK 

ACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCC 

421 + + + + + + 480 

TGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGG 

TISKAKGQPREPQVYTLPPS 

CGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGT-CAAAGGCTTCTATCCC ^ . 

GCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGG 

RDELTKNQVSLTCLVKGFYP 

AGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACG 

541 + + + + + + 600 

TCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGC 

SDIAVEWESNGQPENNYKTT 
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FIG. 24B 



CCTGCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAG 

601 + + + + + + 660 

GGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTC 

PPVLDSDGS FFLYSK'LTVDK 

AGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAAC 

661 + + + + + + 720 

TCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTG 

SRWQQGNVFSCSVMHEALHN 

BamHI 
I 

CACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAACTCGAGGATCC 

721 + + + + + 773 

GTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTGAGCTCCTAGG 

HYTQKSLSLSPGK* 
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FIG. 25A 



Ndel 



CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 + + + + + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTGACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

V DGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 --- + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

T.YRVVSVLTVLHQD-WLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

Y KCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGOGGGTAGGGCCCTACTCGAC 

A K G Q PRE PQVY TL P PS RD EL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + + + : + 5- 40 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWE SNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 
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FIG. 25B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + - + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

Q.GNV F S C S VMH E ALHNH Y TQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTGCACCACCCACTGGGGT 

661 + + + + + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAACGTGGTGGGTGACCCCA 

K SLSLSPG K GGGGGCTTHWG 

BamHI 
I 

TTCACCCTGTGCTAATGGATCCCTCGAG 

721 + + 748 

AAGTGGGACACGATTACCTAGGGAGCTC 



F T L C 



* 
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FIG. 26A 



Ndel 

CATATGTGCACCACCCACTGGGGTTTCACCCTGTGCGGTGGAGGCGGTGGGGACAAAGGT 

2_ f-- -4--. — - h - gQ 

GTATACACGTGGTGGGTGACCCCAAAGTGGGACACGCCACCTCCGCCACCCCTGTTTCCA 

MCTTHWGFTLCGGG G GDKG 

GGAGGCGGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGG 

g2 --- - + -- .-.- + - + - .. + ..-.. + + 120 

CCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCC 



G G 



GGDKTHTCPPCPAPELLG 



GGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACC 
121 CCTGGC AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGG 
GPSVFLFPPKPKDTLMISRT 
CCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAAC 
181 GGACTCCAGTGTACGC ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTtCAAGTTG 



P E V 



TCVV.VDVSHEDPEVKFN 



tggtAcgtggacggcgtggaggtgcataatgccaagacaaagccgcgggaggagcagtac 

241 + + + + + + 300 

ACGATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATG 



W Y V 



DGVEVHNAKTKPREEQY 



AACAGC ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGC 

-,ni --- + + + + + + 36 

TTGTCGTGCATGGC ACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCG 



N S T Y 



RVVSVLTVLHQDWLNG 



AAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACC ATC 

..^ + + + + + + 420 

TTGCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAG 

KEYKCKV SNK ALPAPIEKTI 

TCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGAT 

+ + + + + + 4BU 

AGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTA 

SKAKGQPREPQVYTLPPSRD 

GAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCA^GGC^CTATCCCAGCGAC ^ ^ 

481 CTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTG ' 
ELTKNQVSLTCLVKGFYPSD - 
ATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCC ^ 

54 1 TAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGG 

IAVEWESNGQPENNYKTTPP - 
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GTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGG 

601 + + + + + + 660 

CACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCC 

V LDSDGS FFLYSKLTVDKSR 

TGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTAC 

661 + + + + + + 720 

ACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATG 

WQQGNVFSCSVMHEALHNHY 

BamHI 
I 

ACGGAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 .+ + + + --- 763 

TGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

TQKSLSLSPGK* 
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